AGARD-R-787 


ADVISORY  GROUP  FOR  AEROSPACE  RESEARCH  &  DEVELOPMENT 

7  RUE  ANCELLE  92200  MEU1LLY  $UR  SEINE  FRANCE 


AGARD  REPORT  787 

Special  Course 
on 

Unstructured  Grid  Methods 


OTIC 

ELECTE  I 
JUL2  9  1992  | 


tor  Aavection  uesitmaiea  r 


imiTj 

J1VT7C 


(Les  Methodes  Utilisant  un  Maillage  non  Structure  pour 
l’Etude  des  Hcoulements  Caracterises  par  1’Advection) 


The  material  assembled  in  this  report  was  prepared  and  presented  under  the 
combined  sponsorship  of  the  AGARD  Fluid  Dynamics  Panel,  the  Consultant 
and  Exchange  Programme  of  AGARD,  and  the  von  Karrran  Institute  (VKl) 
for  Fluid  Dynamics.  It  was  presented  in  art  AGARD-FDP-VKi  Special  Course 
at  the  VKl,  Rkode-Saint-  Genese,  Belgium,  2nd— 6th  March  1992 and  at  the 
NASA  Arnes  Research  Center,  Moffett  Field,  California,  United  States, 

28th  September— 2nd  October  1992 . 


AGARD-R-787 


ADVISORY  GROUP  FOR  AEROSPACE  RESEARCH  &  DEVELOPMENT 

7  RUE  ANCELLE  92200  NEUILLY  SUR  SEINE  FRANCE 


AGARD  REPORT  787 

Special  Course 
on 

Unstructured  Grid  Methods 
for  Advection  Dominated  Flows 

(Les  Methodcs  Utilisant  un  Mnillage  non  Structure 
pour  I'Etude  des  Ecoulcments  Caracterises  par  1'Advection) 


Accesion  For  /  ! 

NTIS  CRA&I 

- 

DTIC  TAB 

[  1 

Ummnouccod 

[  j 

Justification 

by 

Distribution/ 

Dist 


/H 


Avail  a.id  ( or 
SpvCiul 


Availability  Codes 


The  material  assembled  in  this  report  was  prepared  and  presented  under  the 
combined  sponsorship  of  the  AGARD  Fluid  Dynamics  Panel,  the  Consultant  v 
and  Exchange  Programme  of  AGARD,  and  the  von  Karman  Institute  (VK1) 
for  Fluid  Dynamics.  It  was  presented  in  an  AGARD-FDP-VKI  Special  Course 
at  the  VK1,  Rhode-Saint-Gencsc,  Belgium,  2nd— 6th  March  1992  and  at  the 
NASA  Ames  Research  Center,  Moffett  Field,  California,  United  States 
28tn  September-  -2nd  October  1992. 


North  Atlantic  Treaty  Organization 
Organisation  du  Traits  de  I'Atlantique  Nord 


% 

v> 


4 


Q  gp —20375 

vJ  C -  .....  um  mil  IHI 


V 

V. 


Best 

Available 

Copy 


The  Mission  of  AGARD 


According  to  its  Charter,  the  mission  of  AGARD  is  to  bring  together  the  lending  personalities  of  the  NATO  nations  in  the  fields 
of  science  and  technology  relating  to  aerospace  for  the  following  purposes: 

—  Recommending  effective  ways  for  the  member  nations  to  use  their  research  and  development  ct  liabilities  for  the 
common  benefit  of  the  NATO  community: 

—  Providing  scientific  and  technical  advice  and  assistance  to  the  Military  Committee  in  the  field  of  aerospace  research  and 
development  (with  partietdar  regard  to  its  military’  application); 

—  Continuously  stimulating  advances  in  the  aerospace  sciences  relevant  to  strengthening  the  common  defence  posture; 

—  Improving  the  co-operation  among  member  nations  in  aerospace  research  and  development; 

—  Exchange  of  scientific  and  technical  information; 

—  Providing  assistance  to  member  nations  for  the  purpose  of  increasing  their  scientific  and  technical  potential: 

—  Rendering  scientific  and  technical  assi‘  tancc.  as  luiucstcd,  to  other  NATO  bodies  and  to  member  nations  in  connection 
with  research  and  development  problems  in  the  aerospace  field. 

The  highest  authority  within  AGARD  is  the  National  Delegates  Board  consisting  of  officially  appointed  senior  representatives 
from  cacti  member  nation.  The  mission  of  AGARD  is  carried  out  through  the  Panels  which  arc  composed  of  experts  appointed 
try  the  National  Delegates,  the  Consultant  and  Exchange  Programme  and  the  Aerospace  Applications  Studies  Programme.  The 
results  of  ACLARl)  work  are  reported  to  the  member  nations  and  the  NATO  Authorities  through  the  AGARD  series  of 
publications  ol  which  this  is  one. 

Participation  in  ACiARD  activities  is  by  invitation  only  and  is  normally  limited  to  citizens  of  the  NATO  nations. 


The  content  of  this  publication  has  been  reproduced 
directly  trom  material  supplied  by  .AGARD  or  the  authors. 


Published  May  1992 

Copyright  ©  AGARD  1992 
All  Rights  Reserved 

ISBN  92-835  0671-5 


Printed  by  Specialised  Printing  Service.',  Limited 
■WCbigwell Lone,  l.oughton,  Essex  Ki I0.il Z 


Recent  Publications  of 
the  Fluid  Dynamics  Panel 


AGARDOGRAPHS  <AG) 

Design  and  Testing  of  High-Performance  Parachutes 
AGARD  AG-3 19,  November  1 99 1 

Experimental  Techniques  in  the  Field  of  Low  Density  Aerodynamics 
AGARD  AG-318  (E),  April  1991 

Techniques  Experimentalcs  Tices  a  TAerodynamhpie  a  Basse  Densite 
AGARl)  AG-3 1 8  (I'R).  April  1 990 

A  Survey  of  Measurements  and  Measuring  Techniques  in  Rapidly  Distorted  Compressible  Turbulent  Boundary  Layers 

AGARD  AG-315,  May  i  989 

Reynolds  Number  Effects  in  Transonic  Flows 

AGARD  AG-303.  December  1988 


REPORTS  (R) 

Skin  Friction  Drag  Reduction 

AGARD  R-786,  Special  Course  Notes.  March  1992 

Engineering  Methods  in  Aerodynamic  Analysis  and  Design  of  Aircraft 
AGARD  R-783,  Special  Course  Notes,  January  1992 

Aircraft  Dynamics  at  High  Angles  of  Attack:  Experiments  and  Modelling 

AGARD  R-776,  Special  Course  Nows,  March  i99i 

Inverse  Methods  in  Airfoil  Design  for  Aeronautical  and  Turoomachincry  Applications 

AGARD  R-780.  Special  Course  Notes,  November  1 990 

Aerodynamics  of  Rotorcraft 

AGARD  R-78 1,  Special  Course  Notes,  November  1990 

ADVISORY  REPORTS  (AR) 

Air  Intakes  for  High  Speed  Vehicles 

AGARD  AR-270,  Report  of  WG  1 3,  September  1 99 1 

Appraisal  of  the  Suitability  of  Turbulence  Models  in  Flow  Calculations 
AGARD  AR-29 1 ,  Technical  Status  Review.  July  1 99 1 

Rotary-Balance  Testing  for  Aircraft  Dynamics 

AGARD  AR-265,  Report  of  WG  II, December  1990 

Calculation  of  31)  Separated  Turbulent  Flows  in  Boundary  Layer  Limit 

AGARD  AR-2  5  5 ,  Report  of  W G 1 0,  May  1 990 

Adaptive  Wind  Tunnel  Walls:  Technology  and  Applications 

AGARD  AR-269,  Report  of  WG  1 2,  April  1990 

CONFERENC  E  PROCEEDINGS  (CP) 

Effects  of  Adverse  Weather  on  Aerodynamics 
AGARD  CP-496.  December  1 991 

Manoeuvring  Aerodynamics 

AGARD  CP  497.  November  1991 

Vortex  Flow  Aerodynamics 

AGARD  CP-494,  July  1991 

Missile  Aerodynamics 
AGARD  CP-493,  October  1 990 


Aerodynamics  of  Combat  Aircraft  Controls  and  of  Ground  Effects 
AGARD  CP-465,  April  1990 

Computational  Methods  for  Aerodynamic  Design  (Inverse)  and  Optimisation 
AGARD-CP-463.  March  1990 

'  Applications  of  Mesh  Generation  to  Complex  3-D  Configurations 
AC.ARD  CP-464.  March  1 990 

Fluid  Dynamics  of  Three-Dimensional  Turbulent  Shear  Flows  and  Transition 
AGARD  CP-438.  April  1989 

Validation  of  Computational  Fluid  Dynamics 
AGARD  CP-437,  December  1988 

Aerodynamic  Data  Accuracy  and  Quality:  Requirements  and  Capabilities  in  Wind  Tunnel  Testing 
AGARD  CP-429.  July  1988 

Aerodynamics  of  Hypersonic  Lifting  Vehicles 
AC.ARD  CP-428.  November  1 987 

Acrodvnamie  and  Related  Hydrodynamic  Studies  Using  Water  Facilities 
AC.ARD  CP-4 1 3,  June  1 987 

Applications  of  Computational  Fluid  Dynamics  in  Aeronautics 

AGARD  CP-4 1 2,  November  1 986 

Store  Airframe  Aerodvnainics 
AGARD  CP-389.  August  1986 

Unsteady  Aerodynamics  —  Fundamentals  and  Applications  to  Aircraft  Dynamics 

AGARD  CP-386,  November  1985 

Aerodynamics  and  Acoustics  of  Propellers 

AGARD  CP-366,  February  1 985 

Improvement  of  Acrodvnamie  Performance  through  Boundary  layer  Control  and  High  Lift  Systems 
AGARD  CP- 365,  August  1984 

Wind  'Funnels  and  Testing  Techniques 

AGARD  CP-348,  February  1 984 

Aerodynamics  of  Vortical  Type  Flows  in  Three  Dimensions 

AC.ARD  CP-342,  July  1 983 

Missile  Aerodynamics 
AGARD  CP-336.  February  1 983 

Prediction  of  Aerodynamic  Loads  oil  Kolorcraft 

AGARD  CP-334,  September  1 982 

Wall  Interference  in  Wind  Tunnels 

AGARD  CP-335.  September  1 982 

Fluid  Dynamics  of  Jets  with  Applications  to  V/STOI. 

AGARD  CP- 308.  January  1 982 

Aerodynamics  of  Power  Plant  Installation 

AGARD  CP- 30 1 ,  September  J  98 1 

Computation  of  Viscous-lnviscid  Interactions 
AGARD  CP-29 1 .  February  1981 

Subsonic/T  ransonic  Configuration  Aerodynamics 
AGARD  CP-285,  September  1 980 

Turbulent  Boundary  layers  Experiments,  Theory  and  Modelling 
AGARl)  CP-271, January  1980 

Aerodynamic  Characteristics  of  Controls 
AGARD  CP-262,  September  1979 

High  Angle  of  Attack  Aerodynamics 

AGARD  CP-247,  January  1 979 


Foreword 


The  objective  of  the  course  am!  the  course  notes  is  to  present  the  state-of-the-itri,  its  well  as  recent  developments  in 
unstructured  grid  methods,  suitable  for  the  computation  of  high  Reynolds  number  compiessihio  and  incompressible  flows,  and 
other  related  subjects.  Topics  and  methods  covered  include: 

—  Least  Squares  Galcrkin  and  Streamline  Diffusion  Finite  Element  Methods 

—  Finite  Volume  Methods  and  Higher  Order  Polynomial  Reconstruction 

—  Essentially  Non  Oscillatory  Schemes  for  Unstructured  Grids  ; 

—  Multidimensional  Upwind  Schemes  on  Triangles  and  Tetrahcdra 

--  Grid  Generation  Methods  for  Unstructured  Grids  Using  the  Frontal  Method  and  Delaunay  Principle 

—  Turbulence  Modelling  on  Unstructured  Grids 

—  Error  Estimators  and  Solution  Adaptivity 

—  Parallel  Computing  on  Unstructured  Grid 

—  Post  Processing  Unstructured  Grid  Data  Bases  for  Flow  Visualization  Analysis 

A  wiue  range  of  applications  is  presented,  going  from  incompressible  free  surface  problems  transonic  aerodynamics  and 
hypersonic  reentry  flows. 

Getting  these  notes  prepared  in  time  was  a  difficult  task.  Only  those  who  have  been  in  this  situation  realize  the  time  and  work  it 
needs  to  write  such  lectures.  Wc  wish  to  congratulate  till  lecturers,  who  in  between  their  busy  professional  activities  have  found, 
or  better  made,  the  time  to  write  detailed  and  high  quality  contributions. 

It  is  our  opinion  that  we  have  here  a  reference  work  which  will  be  used  by  a  whole  generation  of  PhDs  am!  other  researchers  who 
wish  to  jump  into  the  absorbing  subject  of  unstructured  grid  methods  in  CFD. 

We  are-  convinced  that  ’he  lecture  series  for  which  these  notes  have  been  made  will  be  a  great  success,  and  we  thank  both  our 
institutions,  the  von  Karnian  Institute  and  the  NASA  Ames  Research  Center  for  hosting  the  course.  We  thank  AGARD  and  the 
Fluid  Dynamics  Panel,  its  past  Chairman  Jim  McCroskey,  Executive  Winston  Goodrich  and  Secretary  Anne-Marie  Rivault  for 
their  encouragement  and  professional  support. 


Herman  Dcconinck  and  Tint  Barth 
Course  Directors 


Avant-Propos 


L'objct  dc  ce  cours  et  du  support  de  fours  cst  de  presenter  fetal  do  I'art.  aittsi  i|ue  certains  dcvcluppcinciits  ivcents  dans  le 
domainc  ties  methodes  on  maillngcs  non-structures  pour  le  catcut  d’eeoulcmcnts  etimprcssiblcs  et  non-comprossibles  a  grand 
nontbre  dc  Reynolds  et  d'autres  sujets  connexes.  Lcs  questions  et  les  methodes  couvertes  sont  les  sui viintcs: 

—  methodes  ties  elements  finis  moindres  carres  Galerkin  et  a  diffusion  longitudinalc 

—  methodes  do  volumes  finis  et  reconstruction  polynomiale  d'ordre  eleve 

—  methodes  essentiellement  non-oscillatoires  (tour  maillagcs  non-structures 

—  schemas  multidiniensionuels  de  discretisation  dcccntrcc  sur  triangles  ct  tenacities 

—  methodes  de  generation  de  tnaillagcs  pour  grilles  non  structurccs  scion  la  methode  frontnlc  ct  le  principc  de  Delaunay 

—  tuodelisation  tic  la  turbulence  sur  tnaillagcs  non-structures 

—  cstimalcurs  d'erreur  ct  algoritluncs  pour  maillagcs  ndnptatifs 

—  calcul  en  parallele  sur  maillagcs  non-structures 

—  |K>st-traiiemeut  ties  bases  tie  donnees  sur  maillagcs  non  structures  pour  la  visualisation  ties  econlements. 

Un  large  evcntail  d’applications  est  presente,  allant  ties  problcmcx  tie  surfaces  libres  I'tii-compressibles  a  raerodyiuaniquc 
transsonique  et  les  ecoulemems  hypersoniques  rentrants. 

La  ntise  cn  forme  de  ce  support  de  cours  dans  les  delais  impartis  s'est  averce  difficile.  Seals  ccux  qui  out  vctu  unc  tell?  situation 
seront  conscients  du  temps  et  ties  efforts  qu'il  faut  consacrer  a  la  redaction  de  tels  cours.  Ntms  tenons  done,  a  feliciter  tous  nos 
confercncicrs,  qui,  malgrc  ties  cmplois  tie  temps  tres  charges,  out  trouve.  ou  plutdt  creo,  le  temps  nccessaire  pour  eerire  ties 
textes  detailles  tie  grande  quulitc. 

A  no're  avis,  il  s'agit  d’on  veritable  ouvrage  tie  reference  qui  trouvera  un  large  itccueil  aupres  tie  loute  une  generation  tie 
tliplbmcs  troisienic  cycle  et  d'autres  chereheurs  souhuitant  tleeouvrir  le  sujet  passionmmt  ties  methodes  ties  maillagcs  non- 
structures  en  acrodynainiquc  numcriquc. 

Nous  sommes  eonvaincus  qvic  le  cycle  de  conferences  pour  lequel  ecs  cours  out  etc  ecrits  aura  beaucoup  tie  succes.  et  nous 
tenons  a  remereier  nos  deux  organismes,  I'liistiuit  von  Kantian  et  le  NASA  Ames  Research  Center  pour  ('organisation  du  cours. 
Nous  remereions  egalement  1‘AGARD  et  le  Panel  tie  la  dyuumique  ties  tinkles,  son  ancicn  President  Jim  McCroskey,  son 
Atiininisiratcur  Winston  Goodrich  el  sa  Secretaire  Anne-Marie  Rivault  pour  leurs  encouragements  et  lour  soutien 
professionnel. 


I  lerman  Dcconinck  et  Tint  Barth 
Dirceteurs  tie  cours 
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0.  INTRODUCTION 

0.1.  The  SD-method 
The  purpose  of  this  note  is  to  give  an 
overview  of  the  Streamline  Diffusion 
method  (SD -method  below),  also  referred 
to  as  Galerkin/Lcast  Squares  or  SUPG 
(Streamline  Upwind  Pctrov-Galerkin),  as  a 
general  finite  clement  method  for 
hyperbolic  type  partial  differential 
equations  modelling  convection-diffusion, 
compressible/  incompressible  fluid  flow  or 
wave  propagation.  The  SD-method, 
developed  by  'loin  Hughes  and  the  author 
together  with  co-workers  during  the 
eight  ies,  gives  the  first  general  solution  to 
the  fundamental  problem  of  constructing 
finite  element  methods  for  hyperbolic 
problems  with  the  desired  combination  of 
good  stability  and  high  accuracy.  The 
SD-method  is  a  modified  Galerkin  method 
based  on  piecewise  polynomial 
approximation  with  the  following  two  basic 
modifications: 

(0. 1 )  a  "streamline  diffusion"  modification 
of  the  test  functions  giving  a  weighted  least 
squares  control  of  tire  residual  R(U)  of  the 
finite  element  solution  U. 

(0.2)  introduction  of  an  artificial  viscosity 

i  of  typically  the  form 

(  -s  max(c,  Ch| R(IJ) I / |VU|,  Ch3/2)  or 

c  »  inax(c ,  Gh2|R.(U)|/lU[,  Oh3/2), 
where  li  is  the  local  mesn  size  and  <  is 
the  given  viscosity  and  C  denotes  positive 
constants. 

Further,  the  SD-method  is  characterized 

by 


(0.3)  consistent  use  of  space  time  finite 
element  discretization  for  time  dependent 
problems,  with  the  basis  functions  being 
discontinuous  in  time  and  continuous  (or 
discontinuous)  in  space. 

Each  of  the  modifications  (0.1)  and  (0.2) 
increases  the  stability  of  the  underlying 
Galcrkin  method  in  different  ways  and 
through  different  mechanisms.  The  first 
modification  (0.1)  gives  control  of  the 
residual  R(U)  of  the  finite  clement 
solution  U,  obtained  by  inserting  the  finite 
element  solution  into  the  exact  differential 
equation,  in  a  weighted  h9-norm  with 

weight  proportional  to  h- .  The  second 
modification  (0.2)  gives  I^-eontrol  of  all 

first  derivatives  of  U  with  the  weight  c*. 
The  two  modifications  (0.1)  and  (0.2) 
together  add  sufficient  stability  to 
guarantee  e.g.  the  following  important 
properties  of  the  SD-method  (when 
applicable):  maximum  norm  stability, 
entropy  consistency,  error  localization  and 
monotone  shock  resolution.  Further,  the 
modifications  play  a  crucial  role  in  the 
adaptive  SD~ methods  bast'd  on  a  posteriori 
error  estimates  developed  recently.  We 
recall  that  the  finite  element  in  its  basic 
form,  i.e.  the  standard  Galcrkin  method 
with  piecewise  polynomial  approximation, 
which  has  been  so  remarkably  successful  for 
elliptic  and  parabolic  problems,  dot's  not 
work  in  general  for  hyperbolic  problems: 
Unless  the  exact  solution  is  globally  smooth 
(or  the  mesh  sufficient  ly  refined 
everywhere)  the  standard  Galerkin  finite 
element  solution  will  contain  large  spurious 
oscillations  making  the  error  large  over 


large  nm  lions  of  space  ami  lime.  The 
spurious  oscillations  reflect  the  lack  of 
;  ,,ij)ili».y  in  the  standard  (ialerkin  method 
concerning  the  residual  and  derivatives  of 
the  finite  element  solution. 

The  spare- -time  discretization  (0.3)  of  the 
SO -method  offers  a  great  flexibility  in  the 
discretization,  in  particular  through  the 
possibility  of  using  space-t  ime  meshes 
oriented  in  space-time.  This  gives  a  happy 
marriage  of  Kulerian  and  bagrangean 
descriptions  in  the  SD- method  combining 
the  advantages  of  each  of  the  two 
approaches  while  avoiding  the  dis¬ 
advantages  of  a  full  Euclidean  or 
Lagrangcan  Approach.  I'slng  space- time 
meshes  oriented  according  to  characteristics 
(or  particle  paths),  we  obtain  a  special 
variant  of  the  SD-method,  which  we  refer 
to  as  the  Characteristic  SD-method,  or 
( 'SD-method  for  short,  and  which  is  of 
particular  interest  for  incompressible  flow 
including  also  free  (or  moving)  boundaries. 
Orienting  the  modi  in  space-time  according 
to  the  nature  of  the  solution,  we  obtain 
variants  of  the  SD-method  with  features  of 
shook  -fit.?  mg  methods. 

The  SD-method  with  basically  the  first 
modification  (fl.  1)  was  introduced  in  the 
iH'giiming  of  tlu1  eight  ies  by  Hughes  and 
Hrooks  jilllij  for  stationary  convection 
diffusion  problems.  The  theoretical  analysis 
of  t  he  method  in  this  form  including  also 
extensions  to  time- dependent  problems 
basal  on  space-time  finite  elements,  was 
carried  out  in  the  early  eighties  bv 
Johnson.  N avert  and  Pitkaranla  [,INj,  (N). 
(.IN Pi.  A  main  result  of  this  analysis  was 
to  exhibit  the  role  of  the  modificat  ion  (0.1) 
to  increase  the  stability  of  the  Oalerkin 
method,  To  emphasize  t  he  stability  aspec  t, 
the  term  "streamline  diffusion"  directly 
related  to  stability  was  used  (instead  of 
streamline  upwind  (IIB2)),  with  motivation 
from  scalar  convection  problems  where  the 
least  squares  modification  giving  weighted 
L.j  control  of  the  convective  derivative  of 

the  finite  element  solution  corresponds  to 
introducing  a  diffusion  term  acting  in  the 
direction  of  the  streamlines. 

Extensions  of  the  SD-method  to  the 
incompressible  Navior-Stokos  equations 
followed  quickly  ({HB2],  (JSj)  with  further 
developments  in  recent  years  to  a  simple 


equal  order  velocity -pressure  formulation, 
which  has  found  wide  applicat  ions 
beMuding  free  boundary  flow  ([IIS],  [113]). 

A  decisive  step  in  the  development  of  the 
SD-method  was  taken  in  the  mid  eighties 
with  the  introduct  ion  of  the  second 
modificat  ion  (0.2)  cqieiiing  in  particular  the 
possibility  of  applying  the  SD-method  to 
compressible  flow  including  shocks  ([Jll-'ll, 
(JS/.lj,  IJSzll).  It  appears  that  the  residual 

based  artificial  viscosity  of  the  form  c  - 
Ch  ( lt(U)  |  / 1 VU  | ,  which  has  a  mnv  con¬ 
struction  as  compared  to  artificial 
viscosities  used  in  finite  difference/ volume 
methods,  is  close  to  the  minimal  viscosity 
required  to  make  a  (Ialerkin  method 
(including  also  (0.1))  work  well  for  e  g. 
problems  with  shocks. 

As  indicated,  space-time  finite  elements 
were  introduced  early  in  the  SD-method. 
((.INP],  |Nj),  but  the  full  exploitation  of  the 
generality  of  tne  space-t  ime  mesh  was 
initiated  later  (j.ljj,  (J4j,  j.Kij,  (Ha2-3j)  in 
the  form  of  the  OSD-mctnoa  with  applica¬ 
tions  to  free  boundary  flow  ([Had]).  The 
idea  of  using  space-time  finite  elements 
was  taken  up  in  (JShj,  JShllJj)  and  recently 
also  in  (('riJll)  with  applications  to  free 
lioundary  flow.  Another  recent  develop¬ 
ment  concerns  adaptive  SD  and  CSD- 
methods,  (see  (le.lT-8),  [.I2j,  {JSzJj,  ( 1 1 J ] , 
(Jilt),  where  in  a  new  way  the  basic 
modifications  (0.1)  and  (0.2)  come  to  use 
e.g.  lo  prove  a  (Kisterioii  error  estimates 
underlying  the  adaptive  procedures.  As  far 
as  we  know  these  results  are  the  first  to 
show  that  efficient  and  reliable  error 
control  is  possible  for  hyperbolic  problems. 

The  SD-method  has  now  successfully  been 
applied  to,  in  particular,  stationary  and 
time  dejHnident 

(i)  convection  diffusion  problems. 

(ii)  incompressible  Euler  and 
Navior  -Stokes  equations, 

(iii)  compressible  Euler  and 
Na vier - S tokos  equations. 

(iv)  reactive  compressible  flow, 

(v)  second  order  wave  equat  ions  ((.15), 

I'M). 

A  large  number  of  and  numerical 
theoretical  results  are  availabo.  The 


l-> 


theoretical  results  in  general  go  beyond 
previous  results  for  other  methods 
(including  finite  differonce/finitc  volume 
methods)  in  generality  atui/or  precision. 
Some  basic  features  of  the  Sl)-met.hod 
supported  by  theoretical  and  numerical 
evidence  arc  as  follows: 

(0.1)  High  accuracy.  <?(h*)+’)for 
smooth  solutions  with 
polynomials  of  degree  p. 

(0,5)  Good  stability:  sharp  monotone 

resolution  of  shocks  and  contact 
discontinuities. 

(0.6)  localization:  quick  decay  in 

upwind  or  crosswind  propagation 
in  scalar  convection  problems. 
(0.7)  Adaptive  forms  available  based 

on  sharp  a  posteriori  error 
estimates. 

(0,8)  Limits  of  SD-solutions  satisfy  all 

entropy  conditions  (entropy 
consistency). 

(0.0)  Convergence  for  scalar 

conservation  laws  in  several 
dimensions  with  piecewise 
polynomials  of  arbitrary  degree 
on  general  meshes. 
i.0. 1 0 )  Problems  with  free  or 

moving  boundaries  may  Ik* 
approached  in  anatural  way  using 
the  flexibility  of  the  space-lime 
mesh. 

0.1*.  Comparison  with  other  methods  in 
CM). 

We  now  briefly  compare  Lie  SI)- method 
with  other  numerical  methods  in 
Computational  Fluid  Dynamics  CFI)  such 
as  (a)  finite  difference  methods,  (b)  finite 
volume  methods,  (v)  particle  methods  and 
(d)  shock-fitting  methods. 

In  each  case  (a)  -(d)  there  is  a  closely 
related  variant  of  the  SD-mcthod  realizing 
the  essential  feature  of  the  method  (a)-(d) 
in  a  full  finite  element  context. 

(a)  Finite  difference  methods 

The  classical  finite  difference  method  for 
fluid  flow  is  obtained  by  modifying  a 
centered  difference  approximation  of 
convective  terms  by  adding  a  combination 
of  artificial  viscosity  of  first  order 

ChA^U  ami  third  order  Ch^A^U  where 


Aj(  and  Aj^  are  discrete  analogs  of  the 

l.aplacian  ami  hilaplaeian,  respectively, 
with  a  switch  from  first  order  to  third  order 
in  regions  of  smoothness.  Now,  the 
Standard  Galcrkin  finite  element,  method 
typically  produces  centered  difference 
approximations  to  convective  terms  and 
the  modificar  w  (0.1)  and  (ii.lt)  add 
artificial  visa  ,ty  in  non-standard  forms 
including  in  particular  an  automatic 
"switch''  related  to  the  size  of  the 

coefficient,  c  =  Ch  1 11(D)  |  / 1 VI I  j ,  which  is 
typically  of  order  <?(h)  close  to  shocks  for 

«) 

example  and  of  order  0(\i )  or  smaller  in 
regions  of  smoothness  Tints,  the 
SI) -method  contains  basic  features  of 
centered  difference  schemes  with  artificial 
viscosity,  but.  the  technicalities  are 
different  in  the  SD-mcthod.  In  particular, 
the  SD-mcthod  avoids  the  use  of  fourth 
order  dissipation  and  realizes  the  "switch" 
in  viscosity  in  a  full  finite  element  context 
which  is  easy  to  implement  and  accessible 
for  analysis. 

( b )  Finite  volume  methods 

The  SD-mcthod  may  for  non-viscous  flows 
Im  used  with  discontinuous  approximat  ion 
in  space.  We  refer  to  this  variant  of  the 
SD-method,  including  the  modifications 
(0.1)  and  (0.2)  properly  interpreted,  as  the 
Discontinuous  Galcrkin  or  DG -method. 
With  piecewise  constant  approximation 
this  leads  to  more  or  less  classical  finite 
volume  methods  dettending  on  the  context. 
With  higher  order  discontinuous 
I>olynomial  approximation  the  IXJ-meihod 
may  Iw  viewed  as  a  higher  order  finite 
volume  method.  Thus  the  1)G-  method 
gives  the  natural  formulation  of  higher 
order  finite  volume  methods,  which  have 
not-  found  a  really  satisfactory  formulation 
within  the  classical  context  of  finite  volume 
methods  involving  ad  hoc  flux  limiting,  flux 
correction,  post  processing  et.rot..  For  an 
analysis  of  the  DG -method  for  conservation 
laws,  six*  [J.l], 

(c)  Particle  methods 

With  space-time  meshes  oriented  according 
to  particle  paths  the  SD-mcthod  may  be 
viewed  as  an  approximate  part  icle  met  hod 
with  restart,  or  a  variant,  of  a  method  of 
the  form  "exact  transport  t  projection". 
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However,  the  SD-method  does  not  require 
exact  transport  or  precise  particle  tracing. 
The  mechanism  is  just  the  orientation  of 
the  space-time  mesh  in  space-time  which 
brings  in  a  "particle  feature"  to  the 
method,  but  precise  orientation  or  "exact 
transport"  is  not  required.  The  method  will 
work  also  on  non-oriented  standard 
meshes,  but  the  precision  is  improved  by 
suitable  approximate  mesh  orientation. 
Further,  the  SD-methoa  offers  a  built  in 
modified  I^-projection  entering  when  the 

space  mesh  is  changed  at  discrete  time 
levels.  This  makes  it  possible  to  change  the 
space  mesh  often  without  decreasing  the 
accuracy  by  introducing  oscillations  or  too 
much  dissipation.  Thus,  the  SD-method 
with  space-time  meshes  oriented  according 
to  particle  paths  gives  a  general 
formulation  of  a  "particle  method"  with  the 
following  advantages:  precise  particle 
tracing  is  not  necessary  and  frequent 
restarts  with  remeshing  in  space  are 
possible  without  essentially  decreasing  the 
accurary. 

(d)  Shock-fitting  methods 

Finally,  orienting  the  space-time  mesh 
according  to  solution  features  such  as 
shocks  rather  than  particle  paths,  which 
can  naturally  be  realized  in  adaptive  forms 
of  the  SD-method,  we  make  contact  with 
so  called  shock-fitting  methods.  In  the 
SD-method  the  mechanism  is  again  only 
the  mesh  orientation  which  may  Vpically 
be  obtained  from  derivative  information 
from  the  computed  solution. 

To  sum  up,  the  SD-method  characterized 
by  (0.1)-(0.3)  offers  a  large  degree  of 
generality  and  flexibility  and  may  be 
viewed  as  giving  very  natural 
generalizations  of  all  the  classical 
techniques  of  CFD  including  finite 
difference,  finite  volume,  particle  and  shock 
fitting  methods.  Thus,  the  SD-method 
brings  a  surprising  degree  of  unity  to  CFD 
combining  in  particular  the  two  worlds  of 
Euclidean  and  Lagrangean  methods, 
everything  realized  by  a  modifieo  Galerkin 
approach  with  piecewise  polynomials  on 
space-time  meshes. 


0.3.  The  stability  conoept.  Artificial 
viscosity. 

A  main  result  of  the  theoretical  analysis  of 
the  SD-method  is  to  exhibit  the 
importance  of  a  correct  notion  of  stability 
both  for  stationary  and  time  dependent 
problems.  The  classical  stability  concept 
for  hyperbolic  problems  typically  requiring 
in  a  time  dependent  problem  an  Lg-bouna 

of  the  solution  for  positive  time  in  terms  of 
the  L^-norm  of  initial  data  at  t  =  0,  or  in 

the  case  of  a  stationary  problem  an 
Lg- bound  of  the  solution  in  terms  of  the 

L^-norm  of  the  right-hand  side,  turns  out 

not  to  be  fully  adequate.  Instead,  a  new 
stability  concept  for  the  discrete  problem 
involving  in  addition  weighted  L^-control 

of  the  residual  (through  the  streamline 
diffusion  modification  (0.1))  and  first 
derivatives  of  the  discrete  solutions 
(through  the  artificial  viscosity  (0.2))  in 
terms  of  e.g.  the  L^-norm  of  the  initial 

data,  turns  out  to  be  more  appropriate  and 
useful.  In  addition  a  new  stability  concept 
for  linearized  versions  of  the  continuous 
problem  is  used  in  the  context  of  deriving  a 
posteriori  error  estimates  underlying  the 
adaptive  versions  or  the  SD-method.  Thus, 
we  emphasize  the  extreme  importance  of 
using  relevant  stability  concepts  for  both 
the  discrete  and  continuous  problems  and 
the  new  aspects  of  this  fundamental 
problem  brought  to  the  light  through  the 
analysis  of  the  SD-method. 

0.4.  Adaptive  SD-methods.  A  posteriori 
and  a  priori  error  estimates. 

The  improved  stability  properties  of  the 
SD-method  in  particular  make  it  possible 
to  prove  sharp  a  posteriori  error  estimates 
which  may  be  used  to  design  reliable  and 
efficient  adaptive  variants  of  the 
SD-method.  This  seems  to  open  for  the 
first  time  the  field  of  CFD  to  adaptive 
quantitative  error  control  on  mathematical 
basis  with  accurate  resolution  of  fine  scale 
features  in  e  g.  boundary  layers  and  shocks. 

The  a  posteriori  error  estimates  for  the 
SD-method  for  hyperbolic  flow  problems 
typically  have  the  form: 
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(0.10)  ||e|lL  <  C||h2e-1R(U)||I 
Li2  ,j2 

<  C||min(h|VU|,h’R)jj,  , 
l2 

where  e  is  the  error,  R(U)  is  the  residual 
of  the  computed  solution  U,  h  is  the  mesh 

size  and  we  used  the  definition  of  e  .  This 
estimate  should  be  compared  to  the  typicai 
corresponding  estimate  for  the  Standard 
Galerkin  method  for  hyperbolic  problems: 

(0.11)  ||e||L  <  C|jR||L  , 

l2  l2 

and  the  standard  Galerkin  method  for 
elliptic  problems: 

(0.12)  Hell,  <C||h2R||.  . 

lj2  lj2 

The  estimate  (0.1 1)  is  in  general  useless  for 
adaptive  purposes  since  the  right  hand  side 
of  (0.11)  will  increase  with  decreasing  mesh 
size  until  all  features  have  been  resolved. 
The  estimate  (0.12)  for  elliptic  problems  is 
sharp  and  may  be  used  as  the  basis  for 
reliable  and  efficient  adaptive  algorithms. 
Clearly  (0.10)  is  a  mixture  of  (0.11)  and 
(0.12)  using  in  particular  the  ellipticity  in 
the  SD-method  introduced  through  the 

artificial  viscosity  e,  together  with  the 

particular  design  of  i . 

The  proof  of  the  a  posteriori  error  estimate 
(0.10)  has  the  following  structure: 

1 .  Representation  of  the  error  in  terms  of 
the  residual  of  the  finite  element 
solution  and  the  solution  of  a 
continuous  (linearized)  dual  problem. 

2.  Use  of  the  Galerkin  orthogonality  built 
in  the  finite  element  method. 

3.  interpolation  estimates  for  the  dual 
solution. 

4.  Strong  stability  estimates  for  the 
continuous  dual  problem. 

We  note  the  crucial  roles  played  here  by 
the  residual,  the  Galerkin  orthogonality 
and  the  strong  stability  of  the  continuous 
dual  problem. 

The  typical  a  priori  error  estimate  for  the 
SD-method  with  piecewise  polynomials  of 
degree  p  takes  the  form 


(0.13)  ||e||,  <  C||hP+*L>Pflu||L  . 

lj2  h2 

where  u  is  the  exact  solution  and  D^+*u 
is  the  maximal  modulus  of  derivatives  of  u 
of  order  p+1. 

The  proof  of  the  a  priori  error  estimate 
typically  has  the  following  structure: 

1.  Representation  of  the  error  in  terms  of 
the  exact  solution  and  a  discretized 
dual  problem. 

2.  Use  of  the  Galerkin  orthogonality  to 
introduce  the  truncation  error  in  the 
error  representation. 

3.  Interpolation  estimates  for  the 
truncation  error. 

4.  Strong  stability  of  the  discrete  dual 
problem. 

We  note  the  similarity  in  the  structure  of 
the  proofs  of  the  a  priori  and  a  posteriori 
error  estimates,  and  also  the  differences:  In 
the  a  priori  case  the  key  roles  are  played  by 
the  truncation  error  and  the  strong 
stability  of  the  discrete  problem,  and  in  the 
a  posteriori  case  these  roles  are  taken  by 
the  residual  and  the  stability  of  the 
continuous  problem.  Both  the  a  priori  and 
a  posteriori  error  estimates  are 
fundamental:  The  a  priori  error  estimate 
shows  that  the  discretization  error  (and  the 
residual)  will  tend  to  zero  with  decreasing 
mesh  size,  and  the  a  posteriori  error 
estimate  is  the  basis  for  adaptive 
quantitative  error  control. 

0.5.  Summary  of  the  design  principles  of 
the  SD-method 

We  recall  some  of  the  fundamental 
problems  in  CFD: 

0.141  design  of  artificial  viscosity, 

0.15)  unstructured  meshes, 

(0.16)  how  to  combine  Eulerian  and 
Lagrangean  methods. 

Each  of  these  problems  has  received 
massive  attention  over  the  years,  but 
conclusive  answers  are  still  lacking  to  a 
large  extent  within  the  classical 
methodologies  of  CFD  (finite  difference 
methods,  finite  volume  methods,  particle 
and  shock-fitting  methods).  Our  main 
point  is  now  that  the  SD-method  with  the 
three  cornerstones  (0.1)-(0.3)  offers  a  new 
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approach  to  each  of  the  fundamental 
problems  (0.14)-(0.16). 

The  construction  of  the  artificial  viscosity 

e  in  the  SD -met hod  with  the  dependence 
on  the  residual  has  a  solid  mathematical 
basis  and  appears  to  be  close  to  the 
minimal  artificial  viscosity  required  to 
guarantee  features  such  as  entropy 
consistency,  maximum  norm  stability, 
(almost)  monotone  shock  resolution  et  cet, 
of  prime  interest  in  classical  CFD.  In 

addition,  the  artificial  viscosity  e  of  the 
SD-method  makes  it  possible  to  design 
efficient  adaptive  SD-methods  based  on  a 
posteriori  error  estimates.  Thus,  the 
SD-method  offers  a  solution  of  the 
fundamental  problem  (0.14). 

The  space- time  discretization  of  the 
SD-method  has  a  maximal  flexibility  and 
appears  to  solve  (0.16)  as  well  as  of  course 
(0.15). 

To  sum  up,  it  app(  -s  that  the  finite 
element  method  (^alerkin  +  piecewise 
polynomial  approximation)  with  the  design 
principles  (0. 1  )-(C.3),  that  is  the 
SD-method,  gives  a  fresh  approach  to  the 
fundamental  problems  of  CFD  v  a 
surprising  degree  of  unificatic  i  opens 
many  possibilities  for  the  fut 

0.6.  Outline 

An  outline  of  the  coat  ,  of  this  'He  is  as 
follows: 

1.  SD-methods  i?  scalar  linear 

convection 

2.  SD-methods  for  time  dependent  scalar 
linear  convection-diffusion. 

3.  The  CSD- method  for  scalar  linear 
convection-diffusion. 

4.  The  SD  and  CSD-methods  for  the 
incompressible  Euler  and 
Navier-  Stokes  equations. 

5.  The  SD-method  for  the  compressible 
Euler  equations. 

6.  The  DG -method  for  conservation  laws. 

7.  Adaptive  SD-methods  for  Burgers' 
e  nation. 

8.  Numerical  .esults 

9.  Conclusion.  Prospects  for  the  fut  v.. 


1.  SD-METHODS  FOR  STATIONARY 
LINEAR  SCALAR  CONVECTION 
DIFFUSION 

1.1.  Introduction 

As  a  basic  model  problem  we  shall  consider 
the  following  stationary  linear  scalar 
convection-diffusion  problem:  Find  the 
concentration  u  =  u(x)  such  that 

(1.1a)  /?•  Vu  +  ou  -  div(eVu)  =  f  in  D, 
(1.1b)  u  =  g  on  T, 

where  Cl  is  a  bounded  convex  domain  in 

the  plane  [R^,  p  =  P(x)  =  (/?j(x),  /?2(x))  is 

a  given  velocity  field,  o  =  a(x)  is  an 
absorption  coefficient,  e  =  e(x)  is  a 
positive  viscosity  coefficient,  which  we 
typically  assume  to  be' "small"  in  a  sense  to 
be  made  precise,  and  f  e  L 2(fi)  and  g  c 

L2(r)  is  a  given  production  term  and 

boundary  data.  Note  that  (1.1)  is  the  basic 
model  for  processes  involving 
convection-absorption-diffusion-reaction 
with  a  very  large  range  of  applicability.  All 
coefficients  p,  a  and  e  are  assumed  to  be 
"sufficiently  smooth"  according  to 
requirements  made  more  precise  below.  We 
shall  assume  that 

(1.2)  (-2  /  p  +  a)(x)  >  aQ  >  0, 

where  o()  is  a  positive  constant.  This 

assumption  is  not  essential  in  many  cases 

where  it  is  sufficient  to  assume  (-  2  div  P  + 

o)  to  be  bounded  from  below  (cf.  [JNP]). 

It  e  =  0  then  (1.1)  reduces  to 

(1.3a)  /?•  Vu  +  mi  —  f  in  Cl , 

(1.3b)  u  =  g  on  T  , 

where  now  the  boundary  condition  is  only 
enforced  on  th  inflow  part  r_  =  {x  6  F: 

n(x)-/?(x)  <  0}  of  P,  where  n(x)  is  the 
outward  unit  normal  to  F.  The  solution  u 
of  (1.1)  will  typically  have  an  outflow  layer 
of  width  <?(e)  at  the  outflow  part 
r+  =  {x  £  T:  n(x)*/?(x)  >  0}  of  the 

boundary  and  may  have  internal  layers  of 

wi'-di  0{ft)  along  streamlines  x(t)  given 
by  (i,  i.e.  solutions  of 
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(1.4a)  ^  =  #x)  t  >  0, 

(1.4b)  x(0)  =  x, 

e.g.  if  the  Inflow  data  g  is  discontinuous 
or  if  f  is  discontinuous  across  a  streamline. 
Thus,  the  solution  u  of  (1.1)  typically  has 

features  on  the  different  scales  ^(1),  Oyfi) 
and  0{t).  The  essential  difficulty  in  the 
numerical  solution  of  (1.1)  is  now  the 
presence  of  small  scale  features  in  u,  which 
if  not  resolved  may  lead  to  spurious 
oscillations  in  a  standard  Galerkin 
approach.  Let  us  remark  here,  anticipating 
the  discussion  of  adaptive  SD- met  hods 
below,  that  even  if  an  adaptive  process  may 
lead  to  a  final  mesh  where  all  features  of 
the  exact  solution  are  resolved  so  that  on 
the  final  mesh  standard  Galerkin  would  be 
possible  to  use,  the  meshes  in  the  initial 
stages  of  the  adaptive  process  would  not  be 
fine  enough  to  resolve  all  features,  in  which 
case  we  need  robust  discretization  methods 
like  the  SD-method  which  is  able  to 
produce  good  results  without  requiring 
global  resolution.  The  only  way  to  avoid 
this  would  be  to  use  an  intit.ial  mesh 
refined  uniformly  everywhere  which  is  not 
cost  effective,  or  just  impossible  in  most 
interesting  cases. 

The  basic  problem  is  now  to  design  and 
analyze  a  finite  element  method  tor  (1.1) 
which  is  higher  than  first  order  accurate, 
and  which  has  good  stability  properties  so 
that  resolution  of  all  features  v;i)l  not  be 
necessary  to  produce  reasonable  results;  an 
unresolved  local  feature  should  not  degrade 
the  error  globally  but  only  locally.  We  shall 
see  that  the  SD-method  gives  a  very 
satisfactory  solution  to  this  problem.  Note 
that  if  we  give  up  the  requirement  of 
accuracy  higher  than  first  order,  the 

solution  is  imrnedeate:  Just  take  l  ~ 

C  |  /? |  h  as  artificial  viscosity  and  apply  the 
standard  Galerkin  method.  However,  this 
method  in  general  adds  much  too  much 
articial  viscosity  and  will  require  excessive 
mesh  refinement  to  give  reasonable 
accuracy,  and  thus  cannot  be  considered  to 
give  a  satisfactory  solution  in  general.  Note 
that  this  simple  method  corresponds  to  the 
classical  artificial  viscosity  method  or 
upwind  method  in  finite  difference  theory. 

In  the  formulation  of  the  SD-method  below 
the  given  viscosity  c  will,  if  the  mesh  size 


is  not  small  enough,  be  replaced  by  the 

artificial  viscosity  c  depending  on  the 
computed  solution  U  and  the  mesh  size  h. 
It  is  convenient  (and  natural)  to  split  the 
total  error  e  =  u  -  U  into 

(1.5)  u  -  U  =  (u  -  u)  +  (u  -  U), 

where  u  satisfies  (1.1)  with  e  replaced  by 

c,  i.e.,  u  is  the  solution  of  the  continuous 
problem 

(1.6a)  +  au  -  div(eVu)  =  f  in  0, 

(1.6b)  ii  =  g  on  T. 

Now,  u  -  u  is  the  difference  between  the 
solutions  of  two  continuous  problems  with 

different  viscosity  e  and  e,  and  u  -  U  is 
the  discretization  error  related  to  (1.6)  with 

now  e  considered  to  be  given.  The 
advantages  of  the  splitting  (1.5)  are  as 
follows:  The  estimation  of  the 

discretization  error  u  -  U  will  concern  a 
linear  problem,  whereas  the  full  problem  is 

nonlinear  since  e  depends  on  U.  Further 
in  an  adaptive  approach  the  perturbation 

error  u  -  u  may  be  controled  by 

controlling  the  difference  e  -  e,  cf  below. 

1.2.  Finite  element  prerequisites 
For  simplicity  we  shall  below  restrain  the 
detailed  presentation  to  the  simplest 
possible  finite  elements:  piecewise  linear 
functions  on  triangles  (two  dimensions)  or 
tetrahedrons  (three  dimensions).  This  is 
not  essential  and  all  methods  to  be 
presented  have  natural  generalizations  to 
general  piecewise  polynomial  approxima¬ 
tion. 

Below  f 1  will  denote  a  bounded  domain  in 

K^,  d  =  2,3,  with  polygonal  or  polyhedral 
bundary  I\  By  Tj  =  {K}  we  will  denote 

a  triangulation  of  ft,  i.e.,  a  subdivision  of 
ft  into  triangles  (d  =  2)  or  tetrahedrons 
(d  =  3)  K  such  that  different  elements  K 
which  intersect,  share  either  a  vertex  or  a 
side  (d  =  2)  or  a  vertex,  edge  or  face 
(d  =  3).  The  local  mesh  size  of  will  be 

given  by  the  mesh  function  h(x)  satisfying 
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(1.7)  c^hj^  <  h(x)  <  c2hj<-  for  x  €  K, 

where  and  c2  are  positive  constants 

independent  of  and  li^-  is  the 

diameter  of  K.  We  shall  further  assume 
that  the  smallest  angle  of  the  elements  K  e 
TV  are  uniformly  bounded  below. 

The  basic  finite  element  space  used  below 
will  now  be  the  following;  =  {v:  v  is 


We  further  recall  the  following  inverse 
estimate:  There  is  a  constant  C  such  that 
for  all  v  e 

(L9c)  IIVvIIl2(k) 


<  ch" 1  ||v|i 


WK) 


,  K  e  Th. 


We  shall  below  use  the  following  notation 


continuous  on  Cl  =  0  U  T  and  v  is  linear 
on  K,  VK  €  TjJ,  i.e.  the  space  of  con¬ 
tinuous  piecewise  linear  functions  on  SI. 
By  we  shall  denote  a  standard  inter¬ 
polation  operaioi  into  defined  by 

nodal  interpolation  (or  variants  thereof 
involving  local  smoothing)  or  through 
Lg-  projection.  We  shall  assume  that  jt^ 

satisfies  the  following  interpolation 
est  imates:  There  are  constants  C 
independent  of  T^  such  that 

(l.Sa)  iink(v  -  v))iiL2(f2) 

<  C||hm“kDmv||L  (ft)  0<k<m<2, 

2 

d.8b)  ||v  -  V"l2(*) 


(v,w)  =  /  vw  dx, 

SI 

(Vv,  Vw)  =  /  Vv-Vw  dx, 

SI 

||v||  =  (v,v)\  ||Vv|j  =  (Vv,  Vv)*. 


1.3.  Formulation  of  the  SD-method  for 
(1.1)  and  (1.3). 

To  define  the  SD-method  for  (1.1)  with 

g  =  0  for  simplicity,  let  ^  C  Hq(D)  be 

the  standard  finite  element  space  of 
piecewise  linear  functions  on  a 
triangulation  Tj  =  {K}  of  SI  which 

vanish  on  I\  The  SD-method  for  (1.1) 

now  reads  as  follows:  Find  such 

that 


<  Chm“*||Dmv| 


MK)’ 


1  <  m  <  2,  K  G  T, , 


where  I\  is  the  union  of  K  and  the 
nearest  neighbors  of  K  if  m  =  1,  and 

I\  =  K  if  m  =  2,  and 

Dmv  -  max  D°v, 
j  a |  —  in 

with 

«(j,  using  standard  multiindex  notation. 


(1.10)  a(U,v)  — -  L(v)  VvG^h, 

where 

a(w,v)  = 

=  (/?*Vw  +  ow,  v  +  <!>(/?•  Vv+<*v)) 

+  (cVv,  Vv), 

L(v)  =  (f,  v  +  S{0-\v  +  ov)), 
where 

(1.11a)  6  =  Cl  max(h  -  -j^y  ,  0)/ 1/?| , 
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(1.11b)  e  =  t(h,  R(U))  = 

max(e,  C2h  CjjIi3/2), 

R(U)  =  Rj(U)  +  R2(U), 

(1.11c)  Rj(U)  = 

|/?-VU  +  oU  -  div(cVU)-f|  in  K, 

(l.lld)  R-j(U)|k  - 

max  max  i  I  [c  nQ-VU)i /h.., 
ScflK  S  1  b 

where  [f  ng-VU]  denotes  the  jump  in  the 

quantity  e  iig-VU  across  a  triangle  side  S 
(in  the  interior  of  fl)  with  normal  ng. 

Further,  we  use  the  convention  that 
integrals  over  Vt  containing  second 
derivatives  are  to  he  interpreted  as  a  sum 
of  integrals  over  the  elements  K,  and  we 
also  recall  that  h(x)  ~  h^  for  x  6  K.  The 

constants  Cj  and  C2  only  depend  on  the 

type  of  polynomial  approximation  and  the 
shape  of  the  elements.  For  p  =  1  and 
triangular  elements  one  may  take 
C,  =  C2  ~  0.5.  The  constant  may 

normally  be  chosen  to  be  zero,  but  there  is 
(at  least  in  theory)  a  reason  to  have 
Cg  >  0.  Note  that  with  e  very  small,  we 

have  6  ~  -jjj  and  t  ~  (if  C3  is 

small),  while  6  =  0  for  e  >  h|/?|  and 
t  =  t  if  e  >  max  (C2hR(U)/(|VU|+h), 

C-jli^2).  Notice  further  that  in  principle 

l  is  implicitely  defined  through  (1.11b) 

since  R(U)  depends  on  c  through 
(1.11c).  This  may  be  simplified  in  practice 

by  choosing  €  =  0  in  (l.llc,d). 

Notice  that  (1.10)  is  obtained  multiplying 

(1.1a)  by  £(/TVv+c*v),  where  the  e^-term 
may  be  omitted  in  the  case  of  piecewise 
linear  approximation  in  space,  see  [JNP]. 


Let  us  also  formulate  the  SD-method  for 
the  reduced  problem  (1.3)  as  follows  using 
the  space  of  continuous  piecewise 

linear  functions  on  0  without  any 
boundary  conditions  enforced  and  imposing 
the  inflow  condition  (1.3b)  weakly:  Find 
U  €  such  that 

(1.12)  a0(U,v)  =  L0(v)  Vv  €  Vh, 

where 

ao(wiv)  =  a(w>v)  ~  /  wv/?>nds, 


L0(v)  =  L(v)-/  gv/Tnds. 
u  r 


where  6  and  e  are  defined  by  (1.11)  with 
e  =  0. 

Let  us  note  that  with  =  C2  =  =  0, 

in  (1.10)  and  (1.12),  we  get  the  standard 
Galerkin  method  (with  weakly  imposed 
boundary  conditions  in  the  case  (1.12)), 

while  the  choice  Cj  =  0  and  setting  e  = 

'h  |  /? |  would  give  the  classical  artificial 
viscosity  method  corresponding  to  a  full 
upwind  approximation  of  the  convective 
derivative  (with  piecewise  linears).  In  the 
following  table  we  roughly  summarize  the 
characteristics  of  the  three  methods 
discussed  with  a  plus  or  minus  sign 
indicating  a  satisfactory  quality  or  not. 

Sta-  Accuracy 
bility 

Standard 

Galerkin  -  + 

crcrc3=o 

Classical  art- 

ficial  vis-  + 

cosity 

C^O,  c=h|/?|, 

Streamline 

Diffusion 

CrC2~0.5,  +  + 

Cg  small 
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1.5.  The  basic  stability  estimates  for  the 
SD-method 


1.6.  An  a  priori  error  estimate  for  the 
SD-method 


The  motivation  for  the  two  modifications 
(0.1)  and  (0.2)  characterizing  the 
SD-method  is  to  increase  stability  (without 
sacrificing  accuracy).  Let  us  now  derive 
the  basic  stability  estimates  for  the 
SD-method  (1.10)  and  (1.12).  These 
estimates  will  follow  from  the  following 
coercivity  properties  of  the  bilinear  forms 
a(v,v)  and  aQ(v,v):  There  are  positive 

constants  c  depending  on  a  and  Oq 

such  that 


Let  us  prove  an  a  posteriori  error  estimate 
for  the  SD-method  (1.12).  Without  loosing 
the  essential  part  of  the  argument  we  shall 
then  assume  for  simplicity  that 

t  =  0  and  o=0.  Recalling  (1.3)  we  then 
have 

aQ(u,v)  =  L0(v)  Vv  €  Vh, 

which  together  with  (1.12)  gives  the  error 
equation 


(1.13)  a(v,v)  >  c|||v|||2  Vv€$h, 

(1.14)  aQ(v,v)  <  cl||v||l“  VveVh, 
where. 

HI  -  (Ml2  *  ||i*Vv||2  *  ll^-VviiV, 


(1.15)  a^(u  -  U,  v)  =  0  Vv  e  Vjj. 

Recalling  the  stability  estimate  (1.14)  we 
get  using  (1.15)  with  v  =  jr^u  -  U,  where 

Tj  u  6  V,  is  the  interpolant  of  u, 

*  -  U||g  <  a^(u  -  U,  u  -  U) 


IMIo  =  (Ml2  <•  t  / y2 !  #  ■ » !  *)t 

To  prove  (1.13)  we  note  that  by  Green's 
formula 

/  /TVv  vdx  =  -  /  0-Vvv  dx 

ii  n 

-  /  div  fi  v2dx  +  /  0-n  v2ds, 

n  r 


=  ac(u  -  U,  u  -  »hu) 
+  aQ(u  -  U,  TTj^u  -  U) 


=  aQ(u  -  U,  u  -  Tj  u) 

<  ||0*/hV(u  -  U)||*P“*(u  -  7rhu)|| 
+  ||^-V(u-U)|!P^.V(u-V)|| 


which  gives 

(/LVv  +  ov,  v) 

=  ((-  ^div  0+  a)v,  v)  +  l  /  v~(i< nds, 


+(/(u-U)‘Vn|ds)1/2 

r 

(/  (u_3ri,u)2 1 ^’u I ds)1/2 

r  n 


from  which  (1.13)  and  (1.14)  directly  follow 
recalling  (1.2).  We  note  the  improved 
stability  of  the  SD-method  as  compared  to 
the  standard  Galerkin  method  satisfying 

(1.13)  and  (1.14)  with  0  =  0  and  i  =  e. 


<|||.,-U||*C||«-*(u-<rh<i)l|2 

*  C||*Vv(u-irh»)||2 

+  (/  (u-!r.u)Vn|ds)1/2 

r  11 


Recalling  (1.8)  this  proves  that 


Ml 


(1.16a)  lllu-%  <  C||h3/2D2u||, 

which  is  the  basic  a  priori  error  estimate 
for  the  SD-method  (1.12). 

We  note  that  (1.16a),  in  addition  to  the 
slightly  non-optimal  T^-estimate 

(1.16b)  |!u-U||  <  C||h3/2D2u||, 

contains  the  estimate  ||^/0*V(u-U)||  < 

C||h3/2D2u||,  that  is  assuming  for 
simplicity  that  h(x)  =  li  constant  and 
|  (i\  ~  1,  we  have  the  optimal  estimate 

(1.16c)  ||^V(u-U)||<C||hD2u|| 

for  the  convection  operator  d- V  applied  to 
error  u  -  U.  In  particular  we  have  for  the 
residua!  R(U)  -  f-/?-VU-oU  =  /?-V(u-U)  + 
o(u-U)  the  following  optimal  estimate 

(1.16d)  1|R(U)||  <  C||hD2u||. 

The  estimates  (1.16a-d)  for  the  SD-method 
should  be  compared  with  the  following 
estimates  for  the  standard  O.alerkin  method 

(l.l6e)  ||u-U||  <  C||hD2u||, 

(1-I6f)  ||R(U)||  <  C||D2u||, 

which  are  seriously  non-optimal. 

The  error  estimates  ( 1 . 16a-d)  are 
meaningful  only  if  h  is  small  enough  to 
resolve  all  features  of  u  requiring  h  to  be 
smaller  than  c  in  e.g.  outflow  layers.  In 
case  some  features  of  u  are  uot  resolved 
we  may  replace  for  instance  (1.16b)  by  a 
localized  analog  of  essentially  the  form 

(l  ies)  s  c«l>3/2D2ulllj,<n«) 

where  11'  C  Q"  C  ft  and  ii  is  smooth  in 
ft".  The  truncated  domain  ft"  is  obtained 
from  ft  by  downwind  or  crosswind 
"cut-off"  with  cut-off  distances  of  order  0 

(h)  and  r?(h3/^),  respectively.  This 
means  that  unresolved  layers  do  not 
degrade  the  error  at  the  indicated  distances 
in  upwind  or  crosswind  directions.  This  is 
contrast  to  the  standard  Galerkin  where  an 


unresolved  layer  may  degrade  the  error  in 
the  whole  domain.  Proofs  of  tire  indicated 
cut-  off  results  for  the  SD-method,  which 
are  based  on  the  improved  stability 
resulting  from  the  streamline  diffusion 
modification,  are  given  in  [JNP], 

1.7.  An  a  posteriori  error  estimate  for  the 
SD-method 

Let  us  now  prove  a  basic  a  posteriori  error 
estimate  for  the  SD-method  (1.12).  In  this 

case  the  artificial  viscosity  e  plays  a 
crucial  role,  while  we  may  take  6  —  0  and 
also  Q'  =  0  without  loosing  the  essentials 
of  the  argument.  The  discrete  problem  thus 
reads:  Find  U  6  Vj  such  that 


(1.17)  (/MU,v)  +  (cVU,Vv) 

-  /  Uv/?-nds  =  (f,v)  -  /  gv/Tnds 

r  r 


The  perturbed  continuous  problem  with 
solution  u  takes  the  following  variational 
form:  Find  u  e  H*(ft)  such  that 

(1.18)  (/i-Vu.v)  +  (fVii,Vv) 

-  /  uv/Tnds  =  (f,v)  -  /  gv/Tnds, 

r_  r_ 

Vv  e  II1  (ft). 

Let  us  now  introduce  the  following 

continuous  dual  problem:  Find  tp€  Hl(ft) 
such  that 

(1.19)  -(v,  0-Vip)  -  ((div  0)\,  ip) 

+  (eVv,  Vv?)  +  /  vip0-n  <ls 

r 


=  (v,  e)  Vv  c  Il^ft), 

ll1  (ft)  where  e  =  u  -  U  and  f+  =  {x: 

/?(x)-n(x)  >  0}.  Taking  v-e  in  (1.19) 
we  get  the  following  error  representation 
formula 


112 


l|e|r  =  -(e,  /?-M-((<iiv  jdOe,  v>) 


+  /  (U-gMyj-^-n  ds 
P 


+  (cVe,  Vy>)  +  /  Gipfi- nds 
r+ 


=(/?•  Ve,  y>)+(eVe,  V^)-/  ep/?-nds 

'  r 


«  (j£?-Vu,  v?)  +  (cVu,  fy>) 

-  /  u<^/?-n  ds  -  (/?-?U,  v?) 

r 


=  (f-/f-VU  +  div(eVlJ),  *?-*) 

-  £  *  Hr  I*-*)'15 

+  /  (U-g)(v*)0*nds 

r 


(eVU.Vu?)  +  /  U <pft'  n  ds 

r 


(f,v?)  -  /  gv»/?*n  ds  -  (/7-VU,  y») 

r 


-  (cVU,  V^)  +  /  U^*n  ds 

r 


=  (f,  </>-•!>)  +  j  (U-g)(v°"$)/>' » ds 

-(0-W,v>-*)  -  (cVU,V(v?-*)) 


so  that 


(;  .  n  • 

l.oOj 


llc||  =  (f-/y.vu,v»-+) 

+  /  (U-g)(v>-*)/f*n  ds 

r 

-  (cVU,  V(v>-*)), 


where  we  integrated  by  parts  and  used 
(1.17),  (1.18)  and  $  e  Vj  is  arbitrary. 

Integrating  by  parts  on  each  element  K  in 
the  third  term  on  the  right  hand  side  of 

(1.20) ,  we  get 

(1.21)  ||e||2  =(M-VU+div(fVU),v-0>) 

-  ft  /  <  ~  (</>-4>)ds 
K  dK  K 


where 


=  (Rj(U),  v-*) 

+  E  /  hK  09(U)(V»-«)ds. 
K  dK  K  J 


nl(u)|K^ 


(f-/?*VU  +  div(cVU))  |  K  K£Th, 

4  l «  Hg  •  VU]  on  dK , 


hKK.2(U)-  <?lJ 
r?n 


onS  C  dK  , 


■^ISr+(u‘B)^n  0,1  Sc5K> 

if  dK n r  =  0,  sc r+,  or  s c r_, 
respectively. 

We  shall  now  choose  $  =  and  use  the 

following  strong  stability  estimate  for  y  to 
estimate  i p-$  via  (1.8)  (for  a  proof  we 
refer  to  [EJ7]): 

Lemma  1.1.  Under  sufficient  regularity 

assumptions  on  ft  and  t  there  is  a 
constant  C  such  that  if  <p  satisfies  (1.19), 
then 

Ml  +  Ik'MI  +  ||div(cVv?)|| 


+  \\0-V<p  +  v?  div  ft\\  <  C||c|| 
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||tD>|!<C||o||. 

Recalling  the  error  representation  formula 

(1.21)  we  now  get 

(1.22)  ||e||2<e||iiVlrt||, 

where  R  =  R^  +  R.>,  R^  —  |R  J  and 

IU I  —  max  max  I  R0 1 . 

2  K  ScOK  S  Z 

VVe  thus  obtain  the  following  a  posteriori 
error  estimate  for  (1.12)  recalling  that  t  = 
max(C.2hR(U)/|VUl,C3h3/2): 

Theorem  1.1.  There  is  a  constant  C  such 
that  if  U  and  u  satisfy  (1.17)  and  (1  18), 
then  o  =  u  -  U  satisfies 

( 1 .23)  ||e||  <  ||E(h,U,f)||, 


whore 


K(h,U,f)  =  ChVlt 


=  Cmin(C.2*h | VU | ,  C.^'h’R). 

We  note  that  by  ( 1 .  IGd)  the  quantity 

E(h,U,f)  appears  to  be  of  order  <?( h3/“) 
in  regions  of  smoothness  of  the  exact 
solution  and  of  order  ^(1)  close  to  a 
discontinuity.  Thus,  (1.23)  appears  to  be  as 
sharp  as  possible.  For  numerical  results  for 
adaptive  algorithms  based  on  (1.23)  we 
refer  to  (E,17).  In  such  algorithms  one 
seeks,  in  an  iterative  process,  a  mesh 
T|t  -  { K }  such  that 

li(h,U)f)|KhK^TOL/VNll,VK€Th. 

where  TOI,  >  0  is  a  tolerance  given  by 
the  user  and  is  the  number  of  elements 

K  in  Tj(,  corresponding  to  equidistri- 

bution  of  the  element,  contributions  in  the 
right,  hand  side  of  (1.23). 


With  e  >  0,  we  may  add  the  condition 

(  -  t  in  the  adaptive  process, 
corres|>onding  to  resolution  of  all  details, 
see  Section  8. 


2.  SD  METHODS  POlt 
TIME-DEPENDENT  LINEAR  SCALAR 
CONVECTION  DIFFUSION. 

2.1.  A  model  problem 
We  shall  now  consider  the  SD-method  for 
the  following  time  dependent  model 
problem: 


(2.1a)n(  +  /TV u  -  div(cVu)  =  f  in  ft  *  I, 
(2.1b)  u  ~  0  on  T  h  I, 

(2.1c)  u( •  ,0)  ”  u  in  ft, 


where  0  —  /?(x,t)  is  a  given 

smooth  velocity  field,  c(x,t)  is  a  positive 
viscosity,  ft  is  a  bounded  polygonal 

domain  in  K  with  boundary  T,  1  = 

(0,T)  with  T  >  0  is  a  given  time  interval, 
and  f  =  f(x,t)  and  u^  -  Uq(x)  are  given 

data. 

2.2.  Space-time  discretization 

Lot  0  —  “o  <  1 1  <  l2  <-<  ln  <■•■  ~  f 

be  a  sequence  of  discrete  time  levels,  set. 

kn  =  ‘„*l  -‘n  !H,d 

introduce  the  space-time  strips  or  "slabs" 
Sn  =  ft  x  1^.  Let  for  n  =  1,...,  N,  T  — 

(K|  be  a  finite  element  subdivision  of  S 

into  space-time  elements  K  of  diameter 
h^  with  corresponding  mesh  function 

hn(x,t)  and  let  Vncft1(S|l)-  f(vc 

HI(Su):v  =  0  on  F  x  1^}  boa 

corresponding  finite  element  space,  and 
define 

N 

V  =  II  V.. 


{ v:  v|s  €Vn,  n  =  1 N}. 
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The  space-time  inosh  T  =  {I<}  on  !?n 

may  typically  consist  of  standard  tensor 
product  elements  K  -  k*  I  ,  where  k  is 

2 

a  triangle  or  quadrilateral  in  IR  ,  or 
"tilted1' such  elements  (see  below),  or 
tetrahedrons  K  of  height  k  .  The  mesh 

T  may  also  be  more  general  with  time 

steps  variable  in  space. 

We  notice  that  v  €  V  may  be 
discontinuous  in  time  at  the  discrete  time 
levels  t  and  to  account  for  this  fact  we 

define 

v2(0  =  lim  y(.,i  ±s). 
s-*  0 


f  n  n  n 
fv  )  =  v+  -  v_. 


Using  the  proper  generalization  of  (1.9) 
and  (1.12)  to  the  time-dependent  case,  we 
shall  now  seek  an  appioximation  IJ  6  V. 

2.3,  The  SD-mclhod  on  general 
space-time  mesh<*s. 

We  now  formulate  the  SI)- method  for  (2.1) 
as  follows:  Find  f  <e  V  such  that 

(2.2)  a(U,v)  ---  L(v)  Vv  e  V. 

where 


(v,w)  =  /  (v,w)dt, 
Jn 


ci(k“2  +  li“’2|/f|2ri  on  Sn, 

c  =  max  (t.  C2hR(U)/|VU|,  C.^h3/2), 
3 

R(U)=  Y  R.(U), 
j=l  J 

Ilj(U)  =  Ut+^VU-div(cVU)  on  K, 

H.>( U ) | K  =  max  J  |(cn  -VU]|/hK, 

Rj(U)=  l|Un]l/k„  on  S„, 

where  (in  -VUJ  denotes  the  jump  in  the 

quanlity  cn  ’VI!  across  a  side  S  of  K 

with  normal  n  —  (n  ,  n  ),  and  where  (Unj 
is  extended  io  S  as  a  constant  in  t. 

The  basic  a  priori  error  estimate  for  (2.2) 

analogous  to  (1.16a)  reads  in  the  case  r  = 

(  —  o  with  piecewise  linear  approximation 
in  x  and  t. 


a(w,v)  -  1!  a  (w.v' 

it  =0  1 


where 


<  C||h3/2l)2i 


+  Y  J  [w")v  1  dx  +  /  wVdx, 
n=l  o  il  * 

an(W’,v)  rr 

—  (w.  +  /)•  Vw,v  +  iV(v  +  />“•  V  v ) ) 


+  (tVw,Vv)n  +  (<wt.vt)n. 


C2.3)  IIMII  =  (IMIj*  /q) 


N-l  A 

+  E  ||«*(v(^-Vv)||f  (S  s 
n=0  1  '2'nr 


♦V  ll!v"]||f  (ft))1/2. 

n— 0 


L(v)  -  Y  (f,v.  A(v  t//.?v))  +  (un,v . ) 
n~()  1  11  u 


where  h(x,t)  ~  diam  K  if  (x,t)  t  K  and 
D“u  ~  1)7  .  .  ui  is  the  maximal  partial 
derivative  of  u  of  order  2  with  respect  to 
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(x,t).  Wo  note  the  presence  in  |||v|||  ol' the 
jumps  (vnj. 

3.  THE  CSD-METHOD  FOR  UNBAR 
SCALAR  CONVECTION  DIFFUSION 
PROBLEMS 

3.1.  A  model  problem 

We  now  turn  to  a  particular  variant  of  the 
SD-method  (2.2)  for  time- dependent  linear 
scalar  convection  diflusion  obtained  by 
orienting  the  mesh  approximately  along 
characteristics  locally  in  time.  As 
indicated,  wo  will  refer  to  this  method  as 
the  characteristic  SD-method  or 
CSD-method.  This  method  combines  the 
advantages  of  both  Eulerian  and 
bagrangean  techniques  without  suffering 
from  the  disadvantages  of  either  approach, 
i.e.  oscillations  or  excessive  artificial 
diffusion  in  Eulerian  methods  and  mesh 
distortion  difficulties  in  Lagrangean 
methods, 


(T-1)  u(x(  \,r),r) 

=  u,  +  •  Vu  =  u,  *■  /LVu, 

so  that  for  example  in  the  case  t  ~  0,  the 
equation  (3.1)  takes  the  simple  form 

(3-5a)  $£(x(*.r),r) 

=  f(x(\,r),r),  r  >  0, 

(3.5b)  u(x(y.0),0)  Uq(\). 

which  is  a  family  of  ordinary  differentia! 

equations  parametrized  by  \  e  (R".  This 
expresses  the  fart  that  in  Lagrangean 
coordinates  (\,r),  the  convection  part 
(iij  +  /Mru)  of  (3.1a)  takes  a  very  simple 

form. 


To  isolate  the  essential  features,  we  shall  as 
a  model  problem  consider  the  following 
pure  initial  value  problem: 

(3.ia)ut+d- vu-div(fvu)—  f(x,i.)  in  Q, 
(3.1b)  u(  •  ,0)  =  u0  in  (R 

where  Q  =  (R“*(0,T),  and  is  a 

smooth  velocity  field  with 

(3,2)  div  fi  -■  0  in  Q, 

and  f  and  Uq  are  given  functions  with 

compact  support  in  Q  =  IR  *  (0,<»)  and  IR2 
respectively  and  c  >  0  is  constant.  The 
restrictions  to  a  pure  initial  value  problem, 
a  divergence-free  velocity  field  fi  and 
constant  t  are  not  essential. 

We  recall  that  the  characteristics  of  the 
differential  operator  (^  +  /TV)  in  (3.1a) 
are  curves  x  — x(y,r)  given  by 

(3.3a)  ^  ~  $(x,r)  r  >  0, 

(3-3b)  x(\,0)  —  y. 

We  note  that. 


To  be  more  precise,  let  us  now  introduce 
the  mapping  F:  Q  -<  Q  defined  by  (x,t)  - 
F(,\,r)  =  (x(y,r),  r),  where  x(\,r) 
satisfies  (3.3).  We  recall  that  |  J  |  s  dot ,), 

whore  .1  -  satisfies  the  equation 
--^1-  =  div  fi  \  S\  =  0,  |J|(-,0)=  1  so 


that  |  .1 1  (*,;)  =  l  for  r  >  0,  which  shows 
that  F:  Q  *  Q  is  1-1  and  onto.  Defining 

now  u(\,r)  -  u(x,t)  where  (x,t)  -  F(\, r) 


and  Au(y,rl  ~  Au(x,t),  the  problem  (3.1) 
takes  the  following  torm  in  characteristic 
coordinates  (\,r), 


(3.6a)  uf  -  cAu  -  f  in  Q. 

(3.6b)  u(  •,())  =  »i0  in  tR2, 

which  is  a  variant,  of  the  standard  heat 
equation  with  a  variable  coefficient  analog 

A  of  the  usual  Laplacian. 

In  the  CSD-method  for  (3.1)  the  mapping 
F  is  now  built  in,  in  a  suitable  discrete 
form,  which  basically  will  make  the 
CSD-method  for  (3.1)  equivalent  to  the 
Discontinuous  Galerkin  method  (the 
DG-inethod)  for  the  parabolic  problem 
(3.6)  with  tensor  product  elements  in  the 
characteristic  coordinates  (.y,r).  In 


IK. 


Eulcrian  coordinate's  (x,t)  this  will 
correspond  to  using  space-time  meshes  on 
each  slab  S  approximately  oriented  along 

characteristics.  For  the  l)CJ-tncthod  for 
paraltollc  problems  wit  h  tensor  product 
space-time  elements,  a  very  precise  error 
analysis,  including  in  particular  integration 
over  large  time  with  frequent  remeshing,  is 
available  ((Jd),  (J3j).  Applying  these  results 
to  (3.G)  we  obtain  sharp  error  estimates  for 
the  CSD-method  for  (3.1),  which  signi¬ 
ficantly  improve  the  corresponding 
estimates  for  the  general  SD-iuelhod 
without,  mesh  orientation.  In  particular  it 
follows  that  integration  over  largo  time  and 
frequent  retneshing  is  possible  in  tire 
CSV)- method  without,  serious  accumulation 
of  errors  or  dissipation  of  fine  scales. 


will  be  an  approximation  of  d(x,t)  on  S  . 

Here  (x.t)  takes  the  role  (locally  on  each 
S  )  of  the  characteristic  coordinates  ( \,r) 

above.  We  note  that  with  I  the  identity. 


dy. 


=  1 


l 

M) 


(t  -  tny 


H  1 


1+r 


v  B.(X), 


which  proves  that  the  mapping  1*  :  S  - 
Sn  is  one-to-one  and  onto  if 


3.2.  The  CSD-method 
As  above,  let  0  —  t ^  <  t }  <... 

<  t  <  t^+(  -  T  a  sequence  of  discrete 

time  levels  with  associated  time  intervals 
time k„  i„ 

and  slabs  S  =  IR"  *  1  and  let  for  each  n 

k  I  1 1 


a  mm\  uviuvm 


\V 


r  ?  1  ■  ivf 

V.  M  /  >/• 


piecewise  linear  functions  on  a  mesh 

T  -  (x)  on  IR"  with  elements  />  and 

mesh  size  h(1  ~  h  (x)  he  given,  bet  us 

now  for  a  given  <|  >  0  for  each  n 
introduce  a  mapping  l.:t):  -  S  defined 

by 


(3.7)  (x.t)  =  ln(x.t) 


r+T 


Bjfs).  o. 


(x.t)  c  S  , 

where  the  lij  e  VV ^  are  functions  to  be 
defined  so  that 


(3.8)  H(x.t)  n  »(x,t) 


=  t  (t  -  t  J  B;(X), 

j— 0  n  J 


with  c  small  enough.  This  puts  a  mild 
condition  on  the  time  step  I,  if  ;i  is 

smooth.  In  the  case  of  a  non -smooth  /i, 
wo  may  in  order  to  avoid  restrictions  on  the 
time  step  k  ,  define  the  approximation  i) 

by  first  smoothing  .h  Next  we  note  that 

x  -  x(x,t)  defined  by  (3.7)  satisfies 

(3.10a)  —  -  1 1( x . t )  t  e  1  , 

dt  11 


(3.10b)  x(x,t())  -  x. 


This  means  that  the  carves  x(x,t)  defined 
l;»  (3  ’  o)  will  be  approximations  to  the 

exact  characteristics  x(x,t)  defined  by 
(3,3),  if  H(\,t )  approximates  /»\x,t).  In 
the  simple  basic  case  with  q  -  0,  we  have 

(3.11)  B'x,t)  =  B(x,t)  =  B0(Jt), 

where  l>^  C-  will  be  an  interpolaut  of 

f  -  IK -.»■„) • 

Continuing  the  notation  used  above,  we 
adopt,  the  convention  of  as  ociating  a 


•  I  f 


function  v(x.t)  on  Sft  with  a  given 
function  v(x,t)  on  S(i>  nml  vice  versa,  by 

writing  v(x,t)  =  v(X,t)  where  (x,t)  - 

K  (x  t)  We  now  introduce  the  following 
n'* 

space  of  functions  v(x,t)  with  the 
corresponding  v(x,t)  being  tensor-product 
(piecewise)  polynomials  in  (x,t): 

V  =  (v  e  nHsn);  = 

-  2  (t  -  t  v’U:.  where  U-€  Wu). 
j— 0  J 

in  which  an  approximate  solution  V  of 
(3.1 )  will  be  sought  on  each  S^.  Note  that 

(K:  K  =  Kn(KKg,  k«Tu)  gives  a 

subdivision  of  S()  into  oriented  space-time 

elements  ("titled  curved  prisms")  K  = 

I'  ( >.-  *  |  ).  kg’I'  corresponding  to  the 

nv  tr  n  .... 
subdivision  k  €  Tn)  ot  Sn  into 

straight  prisms  k  *  1  . 

We  now  define  the  (’SI) -met hod  as  follows: 

For  n  =0,1 . find  U  s  U|s  e  V  sndt 

‘  n 

that 

(3.12)  /  (V  i  ,}-\V) 

S 

n 

(v  +  ^(v  +  ,FVv))dx<U 


+  /  tVU-Vvdxdt*  /  H»ll)v’Mx 


S 


r 


=  j  f(v  .  A(v  +  ./•  Vv)dxdt 

S„ 


Vv  C  Vn, 


where  l*_  =  Uq. 

^Cj(kn2  ’  0,1  Sn’ 


c  -  max(r.c:.)hH(U)/|'?ti|, 


on  SM. 


3 

mu)-  v,  1UU). 

j=i  J 


u  (U)=|Ut+/f  Vll-f-div(fVU)|  on  K, 

Ib,(<j)|K  ~  max  ^  |[fn  -Vl))|/nK, 

-  dK  - 


H;t(l') 


on  S,.t 


(U»)  i  s  now  extended  to  S()  to  he 
constant  along  the  approximate 
characteristics  (x,t)  =  l'n(x.t),  t.  c-  1^. 


bet  ns  now  reformulate  (3.12)  in 

characteristic  coordinates  (x.t).  We  shall 
i  Itott  (ICO  lol Inu-i n»»  notation  whoro 

V4IV-M  tO’'  '  . . . o  ■■■ 

(x.t)  -  l'n(x,t): 


</x 


where  A  [  =  (A  V  with  T  iiere 
denoting  the  trans|)Ose.  We  note  that 

j  (X  t  )  -  l  for  x  C  lie  and  recall  that 
tr  ’  n 

with  |Jn|  -  dot  ,](l 


IWI  $•!•', ,1  =  <>w  tM-1,,1.  t«V 

from  which  follows  that 

(3.11)  l|.l„l!  <  . 

with  ||  -||  the  matrix  norm  induced  by  the 

bud i dean  norm  in  1R"  if  VB  is  bounded 
on  S  and  kn  is  small  enough.  We 

further  note  that 
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+  fi-Vv  =  vt  +  B-Vv  +  (/?-B)-Vv 

=  ~  v(x(x,t),  t)  +  (ft-  B) 
dt 

•J~*  V-v  =  vr  +  a- V-v  =  vr  +  o-Vv, 

I!  X  u  X  l 


where  V-v  =  Vv.  Changing  to  (x,t) 
coordinates,  (3.10)  now  takes  the  form: 


For  n  =  0,1,2,...N  Find  U 


U|S.eV«’ 


{v:  v  6  V  }  such  that 


/  (Ur+a-VU)(v+£(vr+a-Vv))  I J  | dxdt 
^  l  i>  /  n 

n 


=  /  f(v+d(Vj  +  A"Vv))|  Jn|dxdt,  Vv  €  Vn, 
Sn 


with  t  and  6  defined  as  in  (3.12). 
Clearly,  (3.15)  corresponds  to  the 
DG -method  with  tensor  product 
space-time  elements  for  the  following 
variant  of  (3.6): 

(3.6)  iij;  +  »•  Vii  -  div(eVu)  —  f, 


where  a  will  be  small  if  /?  is  smooth.  If 

|o|  <  Ct,  then  (3.15)  has  full  "parabolic 
nature"  and  the  sharp  error  analysis  from 
[EJ3]  for  the  DG-rnethod  applies,  whereas 

for  o=^(l)  the  general  SD-analysis  may 

be  used.  Now,  |  «|  <  C|  ft  -  B| ,  so  the  size 

of  h  is  directly  connected  to  the  quality  of 
the  velocity  approximation  B  of  0 
defined  by  (3.11).  By  standard  interpola¬ 
tion  error  estimates  with  B  a  suitable 
interpolation  approximation  of  [),  we  have 


(3.17)  ||b-B||L(Q) 

oov  ^ 

<  Cdlk"*1  DfV  ||L  (Q) 

oov 

+ 

where  h  =  h„  and  k  =  k  on  S„. 

n  n  n 

We  now  state  error  estimates  for  (3.15) 

under  various  assumptions  on  the  size  of  a 

as  compared  to  e.  We  recall  that  e  will 

satisfy  C,h^/2  <  t  <  C0h  if  e  is  small. 

As  in  [EJ7-8]  we  shall  compare  U  with 

the  solution  u  of  (3.1)  with  e  replaced  by 

c.  The  error  u-u  changing  e  to  e  in 
the  continuous  problem  may  be  estimated 
separately,  see  [EJ7-8].  For  simplicity  of 

notation  we  identify  u  and  u  below. 


Case  fi):  |d|  <C|c|  in  Sn,  Vn. 

As  indicated,  in  this  case  we  may  directly 
use  the  sharp  error  estimates  for  the 
DG -method  for  parabolic  problems  given  in 
[EJ3]  under  the  (non-restrictive) 
assumption  that 

(3.18)  kn  >  c  max  h“(x)/e. 
x 

We  then  obta!n  the  following  a  priori  error 

estimate  assuming  e  and  a/c  are 
sufficiently  smooth:  There  is  a  constant  C 

such  that  for  t^  <  (max  c)  *  we  have 

with  I  =  (0,  tN) 


(3.19)  !Iu'UHlJ0,T;L2)  - 


< 


CL(||k^+1D^+1Q||L  fQT. 

t  ocAu’ 1  ’ 
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where  L  =  max  (log  -4^  +  1)*^,  and 
n<N  n 

IM!T  rn  ti  i =  SUP  HOII 
JV0,1,1V  0<t<T  L2(Rz) 

This  is  a  fully  optimal  error  estimate  with 
the  constant  C  independent  of  ♦  ^  and 

the  time  step  k  coupled  with  derivatives 

of  u  with  respect  to  the  characteristic 

coordinate  t.  Improved  variants  with  q+1 
replaced  by  2q+l  mav  also  be  derived,  see 
[EJ3].  Further,  the  following  a  posteriori 
analog  of  (3.19)  was  derived  in  [EJ3]  (for 
simplicity  stated  in  the  case  f  =  0): 

(3.20)  ||u-U||L  (o  T;L  ) 

<  CL( II lul IILoo(0,T;L2) 

+  Hh“DhUllL  (O.T:L2) 

+  Hh“lU]/k|| J  (0,T,I  ))’ 

where  the  *  indicates  that  the  integrand 
should  replaced  by  zero  on  I  if  W  j  C 

W  and  [U]  is  extended  to  Sp  as 

constant  in  time.  We  note  the  close 
similarity  with  (3.19). 

The  case  |«|<Ce  considered  typically 
would  be  relevant  with  q  =  0  if  D  3  is 

2 

small  or  with  q=l  if  k  <  Cc,  assuming 

now  bounded,  i.e.,  giving  the  time 
dr" 

step  restriction  k  <  Ch'>^  if  e  =  Ch^2. 
In  both  cases  this  indicates  the  possibility 
of  choosing  the  time  step  kp  larger  than 

the  one  corresponding  to  CFL  >  1. 

Case  (ii):  |  a|  <Ct’. 

In  this  case  we  state  the  following  a 
posteriori  derived  in  [EJ8]: 


»“-u»L2(Q) 

<  C||(k„  *  rVVllL2,Q), 

with  C  independent  of  T.  A  typical  case 
could  now  be  as  follows:  q  =  0, 

c  =  r7(h^/2),  kn  =  Ch^  ,  indicating 

again  the  possiblity  of  taking  the  time  steps 
larger  than  the  space  steps  corresponding  to 
CFL  >  1. 

Case  fin):  |  a|  >  C  e^. 

In  this  case  we  may  use  the  general 
estimate  (2.3)  for  the  SD-method  (2.2) 
analogous  to  (1.16a). 

3.3.  Exact  transport  and  projection  ETP 

In  the  case  (3  =  constant,  q  =  0,  B  =  /?, 
in  which  case  we  may  take  6  =  0,  the  CSD 
method  (3.12)  takes  the  form  (assuming 
also  f  0  for  simplicity):  Find 

U  €  V  =  {v:  v(x,t)  =  w(x)  with  x  = 
x+(t-tn)A  w  e  Wp}  such  that 


(3.21)  /  (lL  +  /LVU)vdxdt  +  /  eVU-Vvdxdt 

q  1  s 

\  n 


+  /  [l)n]v"  dx  =  0,  Vv  6  V  . 

2+  II 


Since  v. +/?>Vv  =  0  if  veV  ,  we  can 

l  II 

write  (3.21)  as  follows:  Find  ll”  e  Wp 
such  that 

(3.22)  /  U”v  dx  +  k  /  eVU"-Vvdx 

2+  n  q  + 

ID" 


=  J  (x-kl/)v(x)dx,  Vvewn, 

R2 

where  we  used  the  fact  that  by  the 
definition  of  Vn_j,  U"(x)  = 
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U””*(x-k  \0).  With  e  =  0  this  means 

that 

(3.23)  u;  =  PnT„U;-‘, 

where  P  :  L..((R-)  -*  V  is  the 

I^- projection  defined  by  (Pnw,v)  ■  w,v) 

Vv  6  V  ,  and  T  is  the  translation 
n’  n 

defined  bv 

(Tnv)(x)  =  v(x  -  kn_^). 

Clearly,  (3.22)  corresponds  to  a  method  for 
the  problem  (3.1)  with  c  =  0  based  on 
"exact  transport  +  L^-projection".  With 

the  proper  definition  of  i  in  the  present 
case,  i.e., 

i  =  c2h2|!u"]|/kn, 

the  OSD-method  (3.22)  corresponds  to  a 
method  of  again  the  form  (3.23)  with  now  a 

modified  L^-projection  P(1:  H*((R2)  -*  Vn 

defined  by 

(3.21)  (Pnw,v)  +  (tknVPnw,Vv) 

=  (w,v)  Vv  €  Vn, 

where  ckn  =  Cgh2 1 IMv  -  w|.  In  the 

presence  of  discontinuities  the  modified 

projection  Pfi  shows  improved 

performance  as  compared  to  the  standard 
I^-projection  Pn,  with  monotone 

resolution  of  discontinuities  where  Pfi 

gives  mild  oscillations. 

We  have  now  seen  that  the  CSD-method  in 
the  case  0  constant  e  =  f  =  0,  and  q  =  0 
reduces  to  a  method  of  the  form  "exact 
transport  +  projection",  or  ETP  for  short, 
with  a  built-in  modified  I^- projection  with 

good  stability  properties.  More  generally, 
the  CSD-method  will  reduce  to  a  method 
of  this  form  (in  the  case  e  =  0)  if  the 


discrete  velocity  B  in  the  CSD-method 
defined  by  (3.8)  is  equal  to  the  exact 
velocity  0.  We  recall  that  the  idea  of  ETP 
underlines  the  Godunov  method  and 
generalizations  thereof  (VanLeer,  PPM), 
and  can  probably  be  viewed,  together  with 
the  idea  of  centered  difference  approxima¬ 
tions  and  artificial  viscosity,  as  the  bearing 
principle  in  classical  CFD  not  including 
finite  element  methods. 

Although  thus  ETP  has  been  fairly 
successful  as  a  principle  for  the  generation 
of  schemes  in  CFD,  ETP  does  not  offer  a 
full  discretization  procedure  since  it 
requires  the  "exact  transport"  step  to  be 
performed  form  one  discrete  time  level  to 
the  next,  which  in  general  is  highly 
non-trivial  (e.g.  requiring  the  solution  of 
Riemann  problems). 

Now,  our  point  is  that  the  CSD-method, 
which  coincides  with  ETP  in  simple  cases, 
and  which  gives  a  full  discretization  for 
general  problems,  may  lie  viewed  as  the 
natural  generalization  of  ETP.  The 
following  advantages  are  obtained  this  way: 

(i)  Fully  discrete  schemes  for  general 
problems  are  obtained;  it  is  no  longer 
necessary  to  solve  the  given  equations 
exactly  between  discrete  time  levels. 

(ii)  The  analysis  of  the  CSD-method  is 
more  precise  than  the  classical  analysis  of 
ETP.  For  example  the  CSD-analysis  shows 

2 

accuracy  <?(h  /VK)  with  piecewise  linears 
in  space  and  q  =  0  when  k  is  the  time 
step,  while  the  ETP-analysis  gives 
o 

0{h  /k).  Further,  with  the  sharp 
CSD-analysis  related  to  (3.19)  it  can  be 
proved  that  in  the  CSD-method  a 
discontinuity  may  be  propagated  over  long 
time  intervals  with  little  smearing,  see  m 

4.  THE  SD  AND  OSD-METHODS  FOR 
THE  INCOMPRESSIBLE  NAVIER- 
STOKES  EQUATIONS 

4.1.  Formulation  of  the  SD  and 
CSD-methods 

In  this  section  we  present  the  SD  and 
CSD-methods  for  the  incompressible 

Navier-Stokes  equations  in  (d  =  2,3): 
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Find  the  velocity  u  --  (u.)d_p  and  t.he 
pressure  p  such  that 

(4.1a)  ut+(u*V)u+Vp  -  cAu  =  f  in  Q, 
(4.1b)  u  —  0  on  T  x  (0,T) 

(4.1c)  u(',0)=u0, 

where  Q  =  il*  (0,T),  and  SI  is  a 

bounded  domain  in  Rd,  e  is  a  positive 
(small)  constant  and  f  and  Uq  are  given 

data. 

Let  now  as  above  {tn}  be  a  sequence  of 

discrete  time  steps  and  let  for  each  n, 

WR  c  Hq(O)  be  a  finite  element  space 

consisting  of  Diecewise  linear  functions  on  a 
mesh  Tr  =  {«}  of  mesh  size  hn>  For  a 

given  velocity  field  U  on  S  =  ft  x  I  let 

us  now  introduce  the  particle  paths  x(x,t) 
defined  by 

(4.2a)  ~  —  U(x,t)  tel 

dt 

(4.2b)  x(x,  tn)  =  x,  x  €  ft, 


(4.3c)  V'J  =  (v:v(vj), 

Q"  =  {q:  4  €  q'J). 

We  can  now  formulate  the  CSD-method, 
for  simplicity  without  shock-capturing 

(c  =  c),  as  follows:  For  n  =  0,1,2,...,  N, 

find  (U,P)  =  (U,P)|j  e  vJJ  x  wjj,  such 
n 

that 

(4.4)  (Ut  +  (U.V)U,v)n-(P,divv)n 
+  (q,  div  U)n+  e(VU,  Vv)n 
+  «j(Ut  +  (U-V)U  +  VP  -  eAU, 
v.  +  (U-V)v  +  Vq  -  eAv) 

+  (ftj  divU,  div  v)n  +  (^U,v)n 
=  (f,v  +  ^(vt  +  (U.V)v 

+  VP  -  eAU))n  V(v,q)eV^QU  , 

where 


and  the  corresponding  mapping  F^:  Sn  -* 

Sn  defined  by  (x,t)  =  F^(x,t)  where  x 

=  x(x,t)  satisfies  (4.2).  We  next  introduce 
for  a  given  q  >  0,  the  spaces 

(4.3a)  V^  =  {v6H1(Sn)d:V(X,t") 


6Z  =  «j(div  U)  = 


0  if  div  U  <  K 
div  U  if  div  U  >  K, 


=  2  (t-t11)iUj(x),  Uj  £  (WnJd), 
(4.3b)  Q"  =  (q  €  n‘(Sn):  q(x,t) 


and  as  above 

(v,w)  =  /  (v,w)dt,  (v.w)  =  /  v-wdx, 

in  a 

(Vv,Vw)n  =  /  (Vv.Vw)dt, 


d 

(Vv,Vw)  =  £  /  (Vv.  -Vw.  dx 
i=l  fl  1  1 


together  with  their  analogs  in 
(x,t) -coordinates: 
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Notice  that  as  defined  the  CSD-method 
(4.4)  involves  an  additional  non-linearity 

since  depends  on  U.  By  standard 

fixed  point  arguments  it  can  be  proved  that 
(4.3)  has  at  least  one  solution  (U,P). 
Normally  we  expect  to  be  able  to  find  such 
a  solution  by  computing  a  sequence  of 

solutions  (ij(k),  pOO)  by  seeking  (U^, 
(k-1)  _,r(k-l)  ' 


p(k))  e  v||V  x  Q„  •  Depending 

on  the  choice  of  initial  approximation 

(U^,  P^)  and  the  number  of  iterations 
we  may  get  different  variants  of  the 

CSD-method:  For  example  with  =  0 
and  k  =  1  we  get  a  standard  SD-method 
with  non-oriented  tensor-product  elements 
in  (x,t).  In  a  typical  implementation  of  the 

CSD-method  we  would  let  be 
obtained  from  the  previous  time  step  and 
iterate  once  (k  =  1). 


For  simplicity,  let  us  consider  the  method 
(1.3)  with  6-  =  0,  j  —  1,2,3,  which  does  not 

essentially  affect  the  details  of  the 
argument  to  follow.  Let  us  introduce  the 
following  linearized  dual  problem:  Find 

6  L2(I;  [II J(Q)]d  x  L2(Q))  =  W  such 

that  in  Q 

(4.5a)  -y>t  -  (u-V)<p+VU-vH-V0-eAy?=  e 
(4.5b)  div  y)=0 


(4.5c) 

div  if 

(4.5d) 

where  e  =  u  -  U.  Multiplying  (4.4)  by  e 
and  integrating  over  1  together  with 

integration  by  parts  gives 


(4.6)  Nlj^, 


This  being  said,  let  us  now  consider  the 
CSD-method  in  the  full  non-linear 
formulation  (4.4).  Wc  note  that  with  (4.2) 
satisfied,  we  have 

—  5  —  U(x(x,t),t)  =  U,  +  U-VU, 
d't  di  1 

which  shows  that  in  CSD-method  in  the 

local  coordinates  (x,t)  on  each  slab  Sn, 

the  convection  term  takes  a  very  simple 
form.  In  particular,  we  have  that  when 

written  in  (x,t)-coordinates,  the  discrete 
equations  (4.3)  correspond  to  a  modified 
Stokes  problem. 

4.2  An  a  posteriori  error  estimate  for  the 
CSD-method 


N 

=  E  {(-  ~  (u-V)v?+  VlJ-p,  e) 

n=0 

♦  M  e)n  +  (cVv,  Ve)n} 

N 

=  E  {(V’.e.)  +(u-Ve,ip)  +(VU-^,e) 

n=0 

-  (0, div  e)n  +  (cV<^,Ve)n-  (p-P,div  y>)n) 

+  E  /  <[Un|, /)dx 
n=0  Q 

N 

=  51  {(u.  +  u-Vu+Vp,  ip)  +(eVu,  Vp) 
n=0  1  n  n 


Let  us  now  give  a.  (formal)  proof  of  an  a 
posteriori  error  estimate  for  the 
CSD-method.  The  proof  is  based  on  a 
strong  stability  estimate  for  a  linearized 
dual  problem,  which  appears  to  be 
fundamental  but  seems  difficult  to  establish 
by  analytical  techniques  in  interesting 
case  .  However,  there  is  a  clear  possibility 
of  testing  this  stability  condition 
numerically. 


-  (Uj,  +  U-VU  +  VP,  v?)n  +  (eVU,  Vy?)n 
N 

+  (0,  div  U)  }  +  E  /([Un],/)dx 
"  n=0  ft 

N 

=  -  E  {(Ut  +  U-VU  +  7P-f,  #>-*), 
n=0  1  11 

+  («vu,  Wv-*))n  *  (div  U,  ^-©)n}n 
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+  £  /  ([Un],  /-*n)dxt 

n=0  fl 

where  ($,  0)L  sV^k'Q^  will  be 
5n  n  n 
chosen  to  interpolate  (y>,0). 

We  shall  now  assume  the  following  stability 
estimate  for  the  linear  dual  problem  (4.5): 
There  is  a  constant  C  such  that 

(4.7)  ||*t  +  (u*VM|L  (Q) 


4  IIV^-cA^IIl^q)  + 

+  ^ill2(Q)-Clle|lL2(Q)- 

Without  the  term  VU  ■  tp  present  this 
estimate  follows  by  multiplying  (4.4a)  by 
(y>t  +  (u-V)^}  and  integrating  by  parts 

using  the  fact,  that  div  u  —  div  y?  —  0. 

Note  that  by  elliptic  regularity  for  the 
Stokes  equations  it  follows  that 

(4.0  liwil^n,  *  IkD^ll^n) 

<C||^-^|L2(11). 

Combining  (4.5)-(4.7)  and  recalling  (1.8) 
we  obtain  the  following  (formal)  a 
posteriori  error  estimate 

(4.9)  Hell L.^(Q)  -  C(llh“f  1r*I,2(Q) 

♦  l’kt  *Rlll,  (Q)  *  ll<«v  UIIL.,(Q)]- 


where 


It  =  |Ut  +  U-VU  +  VP  -  f|  +  , 


and  for  simplicity  the  usual  jump  terms 
involving  i  ^ ,  have  been  omitted. 
Further,  the  viscosity  coefficient  c  in  (4.9) 


should  be  replaced  by  an  artificial  viscosity 

e  defined  as  above.  With  suitable  choice  of 

c,  the  estimate  (4.9)  appears  to  give  a 
reasonably  efficient,  adaptive  algorithm. 

The  stability  properties  of  the  dual  problem 
(4.5)  obviously  play  a  crucial  role.  In 
general  the  stability  of  (4.5)  can  only  be 
evaluated  computationally.  We  have  here 
very  briefly  scratched  the  surface  of  a  topic 
we  hope  to  develop  further  in  the  future: 
Adaptive  (C)SD-methods  for  incompres¬ 
sible  flow,  with  a  very  large  area  of 
application. 

4.3  The  CSD-method  for  free  boundary 
flow 

The  CSD-method  is  ideally  suited  to 
handle  flow  problem  with  free  boundaries 
or  moving  boundaries  with  prescribed 
motion:  Just  let  the  nodes  on  the  boundary 
move  according  to  (4.2)  with  U  a 
computed  or  prescribed  velocity.  To  be 
more  precise,  let  us  present  the 
CSD-method  for  the  Navier-Stokes 
equations  with  free  boundary  occupying  the 
volume  D(t)  at  time  t  6  (0,T):  Find 

(u,p)  such  that  for  tel, 

(4.10a)ut+(u*V)u+Vp-  div  <7=f  inft(t), 

(4.10b)  div  u=0  infl(t), 

(4.10c)  <r*n=0  on  T(t), 

(4.10d)  u(  •  ,0)=Uq  in  ft(0), 

where  a—  {<7.j}  is  the  stress  tensor 
defined  by 

"ij  -  -  ptj  *  2'Ki/u>. 

i  r  &1;  (to;  3 


and  <7-n  =  (S  cr^rij)  is  the  stress  on  the 

boundary  T(t)  of  ft(t)  with  outward 
normal  n.  Here  f!(t)  =  {x(x,t):  x  e  ft(0)} 
where  x(x,t)  satisfies 


(4.11a) 

(4.11b) 


',iY 

=  u(x,t),  t  >  0. 

x(x,0)  =  x- 


i 
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Note  that  (4.10c)  states  that  the  (total) 
stress  on  the  free  boundary  is  zero. 

Under  sufficient  regularity  assumptions  it 
is  known  that  (4.10)  admits  a  unique 
solution  if  T  is  small  enough. 

To  define  the  CSD-method  for  (4.109)  let 
{tn}  be  a  squence  of  discrete  time  levels, 

suppose  the  approximation  ftfi  of  ft(t  ) 

is  given,  let  W  be  the  space  of 

continuous  piecewise  linears  on  a 

triangulation  T  of  Jl  and  define 

and  by  (4.3).  Note  that  in  this  case 
the  functions  in  W  are  not  restricted  to 
be  zero  on  the  boundary  of  ft  .  Note 
further  that  in  this  case  the  functions  in 
and  Q  =  ^  are  defined  on  §n  = 

pU(S„)  with  S„  =  V  V  The 

CSD-method  can  now  be  formulated  as 
follows  ([Ha 3]):  Find 

(U,P)eV^xQ^  such  that 
(4.12)  (Ut*(U-VU)U,v)n 
-  (P,divv)n  +  (q,div  U)n+  2/t(e(U),  c(v))u 
+  ^(llt  +  (U-V)U  +  VP,  vt  +  (U-V)v  +  Vq)n 
+  (A^div  U,  div  v)n  +  (&jU,v)n  + 
^3<(2)it(U)  -  pi) ’ll,  (2/<t(v)-ql)’ii>n 
=  (f,v+«1(vt  +  (U-V)v  +  Vq))ll 

V(v,q)  €  «  Qj;, 

where  j  -  1,2,3,  are  defined  as  above, 
is  a  positive  constant  independent  of  h, 

and 


(v,w)  —  J  J  V’wdxdt 

n  I  ft  (t) 
n  nv  ' 


=  /  vwdxdt, 

<v>w>„  -  / .  /  v-wdsdt, 

n 

where  ftn(t)  =  (x(*,t):  x  G  ^n}  with  x 
satisfying  (4.2)  for  \  €  ft  .  We  note  that 
boundary  condition  (4.10c)  is  enforced 

it 

weakly  in  (4.12)  since  the  velocities  in  V 

are  not  restrained  on  the  boundary  of  S  = 

F^(S  ).  Further,  we  note  that  the 
•S^-term  gives  least  squares  control  of  the 
discrete  boundary  stress. 

In  Section  8,  we  present  some  numerical 
results  from  [Ha3]  for  the  method  (4.12) 
applied  to  non  -  stationary  fountain  flow 

using  q  =  0,  k  =  1  and  with  U^lg 

l3n 

given  by  U I Q 

•Vl 

5.  SD-METHODS  FOR 
COMPRESSIBLE  FLOW 

5.1  Formulation  of  the  Sl)-method 

In  this  section  we  present  the  SD-method 
for  the  compressible  Euler  equations  for  a 
o 

perfect  gas  in  [R  : 

2  2 
(5.1a)  u.  +  L'  f.  (u)  =0  x  e  r,  t  >  0, 

1  i=l  1  xi 

(5.1b)  u(x,0)=Uq(x)  x  €  R2, 

wliere 


1 

0 

W1 

,  fj  =  WjU  +  p 

*li 

w2 

^2i 

e 

Lwi  J 

Here  p  is  the  density,  w  =  (wj,W2)  is 
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the  velocity,  e  is  the  total  energy  density, 
P  =  (7-1)0*  -  p(wj  +  w*2)/2)  is  the 
pressure,  ^  the  Kronecker  delta,  and 

7  >  1  is  a  constant.  The  conservation  law 

(5.1)  can  also  be  written  in  the  form 

2 

(5.2)  u.  +  S  A-(u)u  =  0, 

1  i=l  1  xi 


df. 

where  Aj  =  is  the  Jacobian  of  fj(u). 
Let  now  (S^)  be  the  usual  sequence  of 

space-time  slabs  S}1  —  IR2*  I  ,  let  for  each 


n  V  c  H*(S  )4  be  a  finite  element  space 
and  set  V  =  II  V  .  The  general 


n>0 


SD-method  for  (5.1)  can  now  be 
formulated  as  follows:  Find  II  (  V  such 
that  U  e  II  |  Q  e  V  satisfies: 

5n  11 


(5.3) 


d't  * s  AiO>>ux., 


v  ♦  <(vt  *  i)  a'((U)vx  ))n  ,  vj) 


product  elements  K  =  r  *  I  (or  "tilted" 

such  elements)  with  the  basis  functions 
continuous  in  x  and  discontinuous  in  time, 
see  Section  5.3  below. 


Remark  5.1.  To  see  that  square  root  in 

(5.4)  is  well  defined  we  recall  ([Hi])  that 
there  is  a  positive  definite  matrix  AQ  = 

Aq(U)  such  that  Aj  =  AjAq  is  symmetric, 
i  =  1,2,.  Thus 


A“*  A  A  ' 
A0  Ai  0 


A  a-* 
**i  A0 


is  symmetric,  so  the  similarity  transform 
induced  by  Aq  transforms  the  matrix  M 

=  (k”2  1  +  h  2  E  Ap  to  an  obviously 

positive  definite  symmetric  matrix.  It 
follows  that  M  has  positive  eigenvalues 
and  a  full  set  of  eigenvectors  which  shows 

that  M~  2  can  be  computed.  In  [Ha4] 
explicit  formulas  for  the  eigenvalues  and 
eigenvectors  of  M  are  given.  We  recall 

that  A0  =  where  //( u)  = 

=  -  c p  log(pp”P  is  the  entropy.  H 


+  (cVU,Vv)il+(6Ut,vt)n=0  Vv€Vn, 
where  T  again  denotes  transpose  and 

(5.4)  gT  =  S(U)T 

=  2<kn2'  +"»2  ,1,  on  S„, 

(5.5)  ;=e(l')  =  C2l,14g(MljL, 
R(U)=  |Ut.SA|(U)U  | 

.  ||U"]|/kn  on  S„. 

The  simplest  instance  of  the  SD-method 

(5.3)  is  obtained  with  a  Pj  «  Pq- 

approximation  in  space-time  on  tensor 


5.2.  Entropy  consistency 

To  prove  convergence  of  any  numerical 
method  for  the  Euler  equations  (5.1)  is  for 
the  moment  impossible,  since  existence  of 
solutions  of  the  Euler  equations  has  not 
been  proved  mathematically.  For  the 
SD  -method  the  following  weaker  result  is 
possible  to  prove  (with  polynomial 
approximation  in  space-time  of  any  order): 
Limits  of  SD-solutions  will  satisfy  any 
entropy  condition  for  (5.1)((JSzIIJ,  [Szl ]). 

A  corresponding  result  in  the  same 
generality  for  finite  difference/ volume 
methods  is  known  only  for  first  order 
approximations. 

5.3.  An  explicit  form  of  the  SD-method 

The  SD-method  (5.3)  leads  to  a  system  of 
non-linear  equations  to  solve  for  each  slab 
Sn-  If  the  time  step  is  sufficiently  small 

(corresponding  to  CFL  <  1/2  say),  we 
expect  to  lie  able  to  solve  this  system  with 
few  Newton-like  iterations.  We  shall  now 
consider  the  simplest  case  of  Pj  *  P^- 

approximation  in  (x,t)  on  tensor  product 
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elements  r  *  Ifi  (or  more  generally 

"tilted"  such  elements),  continuous  in  x 

and  discontinuous  in  t.  Writing  Un  = 

U I Q  the  SD-method  (5.3)  can  in  this  case 
sn 

be  formulated  as  follows:  Find  Un  6  W 
such  that 

(5.6)  (Un-Up'\v) 

+  (SA.(Un)U"  ^A.(Un)Tv  ))k 

j  1  Xj  i  1  11 


I -term  on  the  left  hand  side  of  (5.6)  in  an 
iterative  method,  and  using  e.g.  the 
conjugate  gradient,  method,  at  each 
iterative  step. 

In  Section  8  we  present  some  numerical 
results  for  the  explicit  method  (5.7)  applied 
to  a  standard  test  probelm:  Mach  3  flow  in 
a  channel  with  a  step  up.  We  further  give 
some  results  for  (5.6)  extended  to  the 
compressible  Navier-Stokes  equations  for 
flow  over  a  flat  plate.  In  both  cases  the 
adaptive  mesh  control  is  based  on  (formal) 
a  posteriori  error  estimates  of  the  form 
(0.10). 


+  (eVUn,  Vv)kn  =  0  Vv  €  Wn> 

where  Wn  C  H*(R2)^  consists  of 
continuous  piecewise  linears  and  the 
corresponding  V()  c  is  defined  by 

V„-{v6H1(S„)4:v(..l)  =  w  for  t €  1,,. 
where  weWn). 

To  compute  Un  from  (5.6),  we  may 
iterate  in  various  ways:  The  simplest 
poss:ble  variant  is  obtained  by  lumping  the 
mass  matrix  related  t.o  the  inner  product 
(•,•)  in  the  first  term,  and  changing  the 
index  n  to  n-1  in  ail  terms  except  the 
first,  which  gives  a  fully  explicit  scheme  of 
the  form  ([Ha  4]) 

(5.7)  U"='-;lil"l-k„(Sli(U"‘1)x.,»>j) 

-k  (EAjCJ"-')!!  ffiA  (Ul'-1)Tv,j  ) 

i  1  xi  i  1  J'xi 


6.  The  DG-mcthod  for  conservation  laws 

In  this  section  we  present  the 
Discontinuous  Galerkin  method 
(DG -method)  which  is  a  variant  of  the 
SD-inethod  with  discontinuous  approxi¬ 
mation  in  space  as  well  as  in  time.  The 
DG -method  may  be  viewed  as  a  gene¬ 
ralized  Finite  Volume  method.  The 
DG-metl;od  is  fully  analogous  to  the 
SI)-method  with  the  basic  two  modifi¬ 
cations  (0.1)  and  (0.2)  and  in  addition 
certain  jump  terms  related  to  inter-element 
discontinuities. 

Let  as  bovc  {t(i}  be  a  sequence  of  discrete 

time  levels,  In  =  (tn,  tu+]),  Sn  -  1R2x  Ijt, 

and  let  T  —  {K}  be  a  finite  element 

triangulation  of  S  into  space-time 

elements  K  of  diameter  h^.  T’ypically 

the  elements  K  may  be  prisms  k  x  I 

with  k  a  triangle  or  "tilted"  such  prisms, 
or  tetrahedrons.  Define  for  q  >  0 

VnHv€L2(Sn):v|K^q(K),K6Tn}, 


-  kn(^VUn_1,  Vv’j), 


V-  II  V, 
n>0 


where  UK  =  E  U1.1  v?:(x)  with  {v?.}  the 

j  J  J  J 

standard  -basis  functions  of  W[(.  The 

resulting  method  may  be  viewed  as  a 
variant  of  certain  well-known  "upwind" 
finite  difference  methods,  see  [IIa4].  Many 
variants  of  (5.7)  with  improved  per¬ 
formance  in  certain  cases  may  he  obtained 
by  keeping  the  mass  matrix  and  e.g.  the 


We  shall  present  the  DG-metbod  for  a 
scalar  conservation  law: 

2 

(6.1a)  u.  +  E  f.(u)  -0  in  1R2x  |R  ( 

l  j=1  1  Xj  + 

(6.1b)  u(  •  ,0)  =  Xp  in  1R2, 
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where  the  f-:  fR  -+  K  are  smooth  fluxes  and 
•> 

uQ  6  1^(01  )  has  compact  support.  The 

DG-method  for  (6.1)  reads  os  follows:  Find 
Uc  V  such  that  U  =  U  |  g  satisfies 


(6.2)  S  {/  (Ut+Ef.(U)  ) 
KeTn  K  1  1  xi 

(v  +  £(vt+Ef!(U)vv  ))dxdt 
1  i  1  xi 


with  ^  <  a  <  2.  Note  that  here  we  use  a 

variant  of  the  artificial  viscosity  e  used 
above. 

Remark.  Note  that  if  n^  =  (-1,0,0) 
corresponding  to  the  "bottom"  of  K.  then 

rK(U)  =  - 1  (u;  ♦  u») 

+  ^  (U“  -  U“)  =  -u" 


+  /  C  VU-Vvdxdt 
K 

/  (FK(U)-  f(U).nK)vKds)=0 
dK  K  K  K 

Vv  6  Vn. 

where  n^  is  the  outward  unit  normal  to 
K,  Vv  =  (Vv,  vt), 

VK  v  I K’ 

f(U)  =  (U,fj(U),f2(U))% 

(6.3)  FK(U)-i(f(UK)  +  f(UK))-nK 

+  ^I\(^K  "  ^ K ' ^  ^  (x,k)  ^ 

where  K1  is  an  element  sharing  the  face  S 
common  to  K  and  K‘  containing  (x,t), 
and 

CK  =  2  if  11 K  “  *  (1’°-0)> 

=  Ch“ '  otherwise, 

with  C  a  positive  constant  and  7  >  0 
small.  Further, 

#= c,(k-2  +  h"2  s  quo2)-* 

i  =C2h"(|Ut*£(.(Ui  I 

+  sck|,,K'”K’l/hK  in  K’ 


where  as  above  U1)  =  lim  U(tn  ±  s),  so 

*  „  .  A  +  I* 


((F(UK)-f(UK())-nK)vK 

=  (u ;  -  u?)vj . 

which  gives  the  usual  jump  term.  Further, 
if  ~  (1,0,0),  then 

(FK(U)-f(UK))-nK  =  0, 

which  means  that  there  is  no  coupling 
forward  in  time.  Note  further  that  the  F^ 

are  basically  the  Lax- Friedrichs'  fluxes. 

For  q  =  0,  (6.2)  gives  an  implicit  variant 
of  the  Finite  Volume  method.  The  accuracy 

of  (6.2)  is  formally  of  order  <?(h(1+*),  thus 
of  accuracy  higher  than  one  if 
q  >  1 .  in  particular,  the  use  of  the 
Lax-Friedricks'  fluxes  (6.3)  does  not 
degrade  the  accuracy  to  first  order  if  q  >  1. 

It  is  possible  to  prove  convergence  of  the 
method  (6.2)  with  q  >  0  using  the  same 
method  of  proof  as  for  the  SD-method  with 
continuous  basic  functions  in  space  based 
on  the  DiPerna  uniqueness  result  for 
measure  valued  solutions  of  scalar 
conservation  laws  (see  (Sz2),  ( J Jj).  ■ 

7.  ADAPTIVE  CSD-METHOl)  FOR 
BURGERS'  EQUATION 

7.1.  Introduction 

In  [EJi-8],  [J2],  (J5)  and  (JSz2l,  we  prove  a 
posteriori  error  estimates  and  formulate 
corresponding  adaptive  algorithms  for 
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linear  convection-diffusion  problems,  the 
linear  wave  equation,  and  systems  of 
conservation  laws  in  one  space  dimension, 
respectively.  Further,  in  [HJ]  we  discuss 
extensions  and  present  computational 
results  for  compressible  flow  in  two  space 
dimensions.  As  far  as  we  know,  our  results 
are  the  first  to  show  that  reliable  and 
efficient  adaptive  error  control  based  on  a 
posteriori  error  estimates  is  possible  for 
finite  element  methods  for  hyperbolic 
problems.  Of  particular  interest  are  the 
results  on  adaptive  finite  element  methods 
for  systems  of  conservation  laws,  which 
seem  to  open  a  large  area  of  application.  In 
the  proofs  of  the  a  posteriori  error 
estimates  we  use  our  new  strong  stability 
concept  in  a  crucial  way;  using  classical 
stability  concepts  it  appears  to  be 
impossible  to  obtain  useful  a  posteriori 
error  stimulates  for  hyperbolic  problems. 
The  case  of  conservation  laws  presents  a 
remarkable  example  of  the  utility  of  the 
new  concept  of  strong  stability:  In  this  case 
the  linearized  dual  problem  is  indeed 
strongly  stable,  but  is  unstable  vvitli  the 
classical  weak  stability  concept  cor- 

rponnnftintr  tn  foM  that  ef\1  n  t  irt»>e  rtf 

»  VO  vv  iti\  v  tmvv  OWatviViH-  \/» 

conservation  laws  do  not  in  general  show 
continuous  dependence  U,  with  respect  to 

change  of  initial  data  or  right  hand  side  in 


To  illustrate  the  essential  points,  we  shall 
in  this  section  indicate  the  proof  of  an  a 
posteriori  error  estimate  for  a  finite 
element  method  for  a  scalar  conservation 
law  in  one  space  dimension  (Burgers' 
equation).  For  more  details  and  the 
important  extension  to  the  case  of  systems, 
we  refer  to  [JSz2]. 

7.2.  An  a  posteriori  error  estimate  for  a 
finite  element  method  for  Burgers'  equation 

We  consider  Burgers'  equation:  Find  the 

scalar  function  uc  —  ut(x,l)  such  that  for 
x  c-  IR,  t  >  0 


(7.1b)  uf(x,t)  0,  x  ->  *oo,  t  >  0, 

(7.1c)  ue(x,0)  =  Uq(x),  xeH, 


where  u0  e  L  (W)  has  compact  support, 

and  c  is  a  small  positive  constant..  For 
e  >  0  this  problem  is  uniquely  solvable.  As 

e  tends  to  zero,  the  solution  ue  will 
converge  to  a  limit  u,  the  entropy  solution 
of  the  inviscid  Burgers'  equation 
corresponding  to  (7.1)  with  c  =  0.  Even  if 
the  data  is  smooth,  u(  •  ,t)  may 

become  discontinuous  in  finite  time, 
corres|X)nding  to  the  development  of 
shocks.  The  entropy  condition  states  that 
at  shocks  (discontinuities),  we  have 

u"(x,t)  >  u+(x,t),  where 

u±(x,t)  =  lim  u(x  +  s,t) 
s-*0 


is  the  left-hand  and  right-hand  limit, 
respectively.  This  means  that  close  to 

shocks  (-^r  may  be  very  large  negative 

(but.  will  be  bounded  above).  In  such 
a  case  the  linearized  problem 

(7.2a)|f.|-(ut(/>)  -  c  fty '  “  0,  in  Q, 

(n  o\  - 

(7.2b)  <f>{x,  t  )  -a  0,  x-»oo 

(7.2c)  tf(x,C)  =  0Q(x)x  €  IR, 

where  Q  =  R  *  !R+  obtained  by  linearizing 

(7.1a)  around  the  solution  ir,  is  unstable 
in  L9,  since  multiplicating  (7.2a)  with  (f> 

and  integrating  with  respect  to  x  leads  to 

2HT^2(x’t)dx 


♦  «/ 
ill 


ox 


“  dx  —  -  2  /  0“(x,t)dx, 


where  the  right-hand  side  may  be  large  if 

^7  is  large  negative,  which  corresponds  to 
the  fact  that  (7.1)  is  not  continuous  in  Ly 
with  respect  to  I^-perturbations  of  initial 
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data.  This  argument,  seems  to  indicate  that 
(i  posteriori  error  control  in  L2  for 

approximate  solutions  of  (7.1)  would  be 
impossible  in  the  presence  of  shocks.  The 
remarkable  fact  is,  however,  that  such  an 
error  control  in  fact  is  possible  to  establish, 
if  we  use  a  proper  stability  concept  and  use 
the  Galerkin  orthogonality  intrinsic  in  the 
finite  element  method,  which  we  now 
proceed  to  demonstrate. 

The  a  posteriori  error  estimate  in  Lj,  for  a 

finite  element  method  for  (7.1)  to  be 
presented,  may  be  extended  to  the  case  of 
system  of  conservation  laws  in  one  space 
dimension.  This  yields  a  way  out  of  the 
stalemate  position  of  the  classical  approach 
to  conservation  laws,  based  on 
L, -continuity,  which  is  limited  to  the 

scalar  case. 


For  simplicity,  we  shall  consider  a 
"semi-discrete"  finite  element  method  for 
(7.2)  of  the  form  of  "exact  transport  + 
L9- projection",  cf.  [J4].  The  extension  to 


the  fully  discrete  case  with  discretisation  in 
space-time,  using  space-time  elements, 
follows  the  same  principles,  see  (JSz3j. 


Let  0  tp  <  tj  <  t.y  <...  be  a  sequence  of 

discrete  time-levels  and  let,  for  eacli  time 
interval  l(1  --  (tn_p  tn),  a  finite  element 


space  V  C  II 1  (K)  be  given,  consisting  of 

piecewise  linear  continuous  functions  on  a 
finite  element  subdivision  T  ~  (Kj  of  IK 


For  simplicity,  we  shall  assume  h  to  be 
independent  of  x  and  t ,  but  this  is  not 
essential.  We  shall  consider  the  following 
finite  element  method  for  (7.2):  Find  Uj 


such  that,  for  n  =  1,2,..., 
satisfies  on  IR  *  I  , 


uh 


dru 


(7.3a)-^— 


0 


1? 


0, 


(7.3b)  Ujt(x,t)  ->0,  x-*±®,t>0 

(7.3c)  W|+1(x,tn_l)=l>nuj'J(x,tn_i  )  ,xgR, 


where  c  ~  Ch, 

W  “  !‘“*  V*’ 

U 

u“(x,0)  =  u0(x),  and  Pn:  II 1  (®)  -  Vn  is 
the  L^-projcction  defined  by 

(7.4)  /  l>  vw  dx  =  /  vw  dx  Vw  6  V  . 

K  n  K  n 

Note  that  (7.3)  corresponds  to  a  method  of 
the  form  "exact  transport  +  projection", 

similar  to  (for  instance)  the  Godunov 
method,  cf.  [J4j.  Note  further  that  the 

viscosity  coefficient  e  in  (7.3)  is  chosen  as 

i  -  Ch,  which  makes  (7.3)  at  most  first 
order.  Other  more  sophisticated  choices  of 

e,  with  e  depending  on  the  residual  of 

(shock -capturing  artificial  viscosity)  are 
possible,  giving  higher  order  methods,  see 
above. 

We  shall  now  prove  an  a  posteriori  error 
estimate  in  L^L.,)  for  the  discretization 

error  c  -  u£  -  u^,  where  if  satisfies 

(7.1)  with  c  =  c,  which  appears  to  lie  or 
order  <?(h~)  in  smooth  parts  and  of  order 
^(h  J)  in  the  presence  of  shocks.  To  obtain 
a  complete  estimate  for  if  -  Uj ,  we  would 

also  need  to  estimate  if  -  <f ,  i.e.,  the 
effect  of  changing  the  viscosity  coefficient 

in  the  continuous  problem  (7.1)  from  e  to 
<:  We  refer  to  (EJ7-8)  and  (.IS?, 2)  for 
details  in  this  regard. 

Let  now  T  ~  >  0  be  a  given  final  time, 

and  let  us  seek  an  a  posteriori  error 
estimate  for  the  quantity  ||e||j  where 

Q  =  St»  (0,T).  To  this  end,  we  introduce 
the  following  linearized  dual  problem,  the 
stability  properties  of  which  are  crucial: 


I -.111 


r,„  \  „  dd)  -  C?''i/> 

^.5a)-  ;ir  ~a  Tjt  -  c  -l-  =  e  , 


cfiT  TJx 


i  n 


(7x‘ 


(7.5b)  0(x,  t)-*0  .  x-tAoo ,  0<ct<ci', 


(7.5c)  *(x,T)=0,  xe  R, 

Q, 

where  a  —  (u(  +  t*jj)/^-  Multiplying  (7.5a) 

by  e,  integrating  by  parts  over  each 
I  ,  n  =  1,.  .,  N,  and  using  that 

wo  obtain 


following  strong  stability  estimate  for  the 
solution  <!>  of  the  dual  problem  (7.5): 


Lemma  7.1.  Suppose  is  bounded  from 

above.  Then  there  is  a  constant  C  such 
that  the  solution  <f>  of  (7.5)  satisfies 


f 


n=  l  IR 


Vl>dx' 


so  that,  by  (7.1a)  with  c  -  c  and  (7.3a), 
using  the  defining  property  (7.-1)  for  P  : 


(7.6)  ||e|| 


MQ) 


(1  -  P„)«x,  *„_,)<tx. 


To  estimate  the  quantity 

(I  -•  Pu)0(  •  ,t  j ),  we  shall  use  the 


Proof.  Multiplying  (7.5a)  by  -<  —t , 

integrating  over  Q,  and  integrating  by 
parts,  we  get 


which  proves  the  lemma  by  a  (irbnva.ll 
inequality,  since  the  third  term  on  the 
left-hand  side  integrate  to  zero.  a 


We  now  turn  to  the  error  representation 
(7.6).  Using  the  standard  estimates  for  the 
(^-projection  1% 


-p^iK-VPH, 


J‘> 


<  Ch2 


dxZ  n  1 


l-.ll 


we  get  by  Cauchy's  inequality 

Ml, ,(«)-[  11(1  -  ,>i,)u|’,(’''n-l)|li!,  kii 

it  n—  i  ^ 


■h.4  N  „  ... 

dx'  11  1 


Ch 
7TT? 


n 


£ 

n=l 


where  k(J  =  t|(  -  tn_  j.  If  wc  now  assume 
that  kn  >  Ch  for  some  positive  constant 

0,  and  recall  that  c  -  Ch,  we  have  that 


IN 


2 

L2(Q) 


<  CA 


N  t  1/2 

^jlK^W’ViJHlVnJ  ’ 


where 
A  = 


r  N  „  .,2 


il=l 


>,  HHt-Vi) 


chi 


1/2 


mav  he  viewed  as  an  approximation  of 
.-2  , 

Hr  ---4 II,  ,  cf.  below.  Using  I.emma 
4.1 ,  we  thus  conclude  that 


estimating  the  projection  error  (1  -  P 
as  follows  ([JS/,2]); 

<  Cl.2||l>2„-(.,tii_1)||  ,, 

L 

where,  with  K  =  (Xj,x2)  e  I'n_p 
Dhuh(x,Ln- 1 )  I K 


=  h~^  max 

1=1,2 


r^U 


(hi  v  i  n-1 


+  max 
K 


(h\  r 

%».■•*  It  I - —  /  V  .  I  I  lio  ilium  in  llu. 

.  ‘(At  v'i’n  -r*  . . . . 

tfu," 

derivative  at  (xj,  t(J  j).  Extending 
l)“iij”  to  (0,T)  by  defining 


(7.»)  IMMq,  <  CIO  -  lMQ) 


0|"»h(x.U  =  0"»|",0'.ln-P  ,m  15  *1). 


where,  for  convenience  of  notation,  we  have 
defined  for  teln, 

(I  -  P)ujJ(x,t)  =  (I  -  Pn)ujj(x,  tn_,) 


we  can  thus  express  the  a  posteriori  error 
estimate  (7.0)  alternatively  as  follows: 


(7.1°)  IMIL  j(Q)  <  Cllh“Dhuhll|l.)(Q) 


so  that 

lie  -  ,>>»r,Hj.2(Q) 

Wo  have  thus  arrived  at  the  surprisingly 
simple  a  posteriori  error  estimate  (7.9), 
which  can  be  made  more  concrete  by 


Some  comments  are  in  order.  First,  the 
assumption  used  above,  that 

;»2  i 

A  ss  || f  Jyllj  can  easily  be  avoided. 

Ox "  Ty) 

using  also  the  presence  of  the  other  terms 
in  the  strong  stability  estimate  (7.7),  see 
[JSz2]  for  details.  Secondly,  the  a  posteriori 
estimate  (7.10)  appears  to  be  optimal,  i.e., 
in  particular  second  order  accurate  for 
smooth  solutions.This  is  a  consequence  of 

the  choice  of  c  as  i  —  Ch,  which  itself 
would  correspond  to  an  0{\\)  perturbation 
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for  smooth  solutions,  and  thus  reduce  the 
accuracy  of  the  total  approximation  to  first 

order.  With  the  more  elaborate  choice  of  e 

as,  for  instance,  c  =  C  max(h  R(u^), 

h“/2),  where  R(uj )  is  the  residual  of  u^ 

defined  below,  which  is  a  typical  choice  of 
the  artificial  viscosity  in  the  SD-method 
(see  above),  we  would  obtain  a  method  of 

total  accuracy  <?(h^/2)  for  smooth 
solutions  satisfying  the  following  a 
posteriori  error  estimate  (cf.  (1.5)) 

(7-n)  |MiMQ) 

<  C||min(l,  hl/2R(uh)||L^Qj 
or 

(7.i2)  INI^qj 


Remark  7.1.  Note  that  the  estimates  (7.13) 

indicate  <?(Vh)  accuracy  (which  is 
optimal!  in  the  presence  of  shocks  since,  in 
an  t?(h) -neighbourhood  of  the  shock,  the 
integrands  on  the  right-hand  side  would  be 
of  order  ^(1). 

Remark  7.2.  The  dual  problem  (7.5)  may 
be  I^-unstable,  since  multiplication  of 

(7.5a)  with  <p  gives 


"5m'^2^dx 


i  /  757  02(x,t)dx  +  /  e(x,t)0(x,t)dx, 


da 

which  may  lead  to  instability  if  ^  is  large 

negative,  cf.  the  discussion  above  for  the 
linearized  problem  (7.2). 


<  Cjjniin( i ,  li3/2D2ujj)||L^Q), 
where  we  define 

ll<"b>  =  (‘ -p„)uh('-‘, ,-])/•>  0,1 

We  summarize  the  results  obtained  for 
Burgers'  equation  as  follows: 

Theorem  7.2.  Let  u^  be  the  solution  of 

(4.3)  and  uc  that  of  (7.1)  with  c  =  e  = 
Ch  and  k](  >  Ch.  Suppose  that 

~  2  uh^  's  1>ou,K*C(l  from 

above,  and  that  uf  and  Uj  are  bounded. 
T hen  tiiere  are  constants  C  such  that 

(7.13)  l|u‘-“hllI,2(Q) 

<  C||<1  -  PKI^g,  <  ClhVuil^g,. 


For  analogs  of  (7.13)  for  full  discretizations 
of  (7.1)  and  extensions  to  systems  of 
conservation  laws  in  one  space  dimension, 
we  refer  to  [JSz3l,  where  we  also  consider 

da 

the  case  of  rarefaction  waves  with  ^ 

possibly  large  for  t  small,  (757  <!/*■)> 

using  a  weighted  norm  technique.  The  a 
posteriori  error  estimate  of  Theorem  7.2 
may  be  extended  to  systems  of  conservation 
laws  in  one  dimension  under  appropriate 
assumptions  including  the  presence  of 
shocks,  and  rarefaction  waves.  As  far  as  we 
know,  these  results  are  the  first  to  show 
that  a  posteriori  error  control  for  systems 
of  conservation  laws  is  possible.  To 
establish  the  crucial  stability  estimates  in 
the  system  case  corresponding  to  Lemma 
7.1,  diagonalization  together  with  a 
weighted  norm  technique  is  used. 

The  techniques  for  proving  a  posteriori 
error  estimates  for  conservation  laws 
indicated  above  may  formally  be  extended 
to  systems  of  conservation  laws  in  several 
dimensions,  leading  to  a  posteriori  error 
estimates  of  e.g.,  the  form  (0.10),  if  the 
corresponding  linearized  dual  problem 
satisfies  strong  stability  estimates 
analogous  ,o  (7.7).  In  [II J]  we  give 
computational  results  for  the  corresponding 
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adaptive  algorithms  in  the  case  of 
time-dependent  compressible  flow  in  two 
dimensions.  Below  we  present  a  correspond¬ 
ing  result  for  a  stationary  shock  reflection 
problem.  The  question  if  the  linearized  dual 
problem  satisfies  the  strong  stability 
estimates  in  the  case  of  systems  of 
conservation  laws  (or  the  incompressible 
Navier-Stokes  equations)in  several 
dimensions  is  theoretically  very  complex, 
but,  as  indicated,  may  probably  be  tested 
computationally.  We  plan  to  give  more 
details  on  this  topic  in  future  work. 

8.  NUMERICAL  RESULTS 

In  this  section  we  present  some  numerical 
results  for  SD  and  CSD-methods  applied  to 

(8.1)  stationary  convection-diffusion 

(8.2)  non-stationary 
convection-diffusion 

(8.3)  the  incompressible  Navier-Stokes 
equations  with  free  boundary 
(fountain  flow) 

(8.4)  the  compressible  Euler  and 
Navier-Stokes  equations 

9.  CONCLUSION.  PROSPECTS  FOR 
THE  FUTURE 

We  have  given  an  overview  of  the 
SD-method  as  a  general  unified  approach 
to  CFD  based  on  the  principles:  modified 
Galerkin  ■>  space-time  finite  elements. 

The  SD-method  may  be  viewed  to 
generalize  all  the  main  classical  techniques 
of  CFD  such  as  finite  difference,  finite 
volume,  particle  and  shock-fitting  methods. 
The  SD-method  proposes  a  solution  to  the 
fundamental  problems  of  design  of  artificial 
viscosity,  combination  of  Eulerian  and 
Lagrangean  approaches,  adaptive  quanti¬ 
tative  error  control,  and  of  course  is 
applicable  on  general  unstructured  meshes. 

The  method  appears  t.o  have  a  strong 
potential,  in  particular  in  adaptive  form. 
Various  features  of  the  SD-method  are 
today  used  in  several  commercial  and 
research  codes.  A  rapid  development  into 
full  exploitation  of  SD-features  such  as 
space-time  elements  and  adaptivity  in 
these  codes  is  to  be  expected. 
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Standard  Galerkin 


Classical  artificial  diffusion 
Upwind  method 


The  sd-inethod 


/?  •  V  —  eAu  =  0  in  (0,  l)2, 
u  =  1  if  x  =  0  or  y  —  1 
u  =  0  if  y  =  0  or  x  =  1 


0  =  (2,1) 


FIG.  1  -  Comparison  of  Standard  Galerkin,  Classical  artificial  diffusion  and  the  SD-method 
for  a  stationary  convection-diffusion  problem  with  internal  layer  and  boundary  layer 


The  CSD-method  without,  shock-capturing 


The  CSD-method  with  shock -capturing 


FIG.  2  -  The  CSD-method  for  linear  convection  problem  with  L>2-projection  with  and 
with  shock-capturing  artificial  viscosity  at  each  time  step  (level  curves  of  initial  data  and 
CSD-solution  after  5,  50  and  100  time  steps) 
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Stabilized  methods 

Example:  Scalar,  steady  advection-diffusion  equation. 

Cu  ='  a  ■  V  ii  -  V  ■  kVu  —  /  onllC^ 

Assume  V  •  a  =  0,  k  >  0. 

Consider  u  =  0  on  I  (Dirichlet  problem). 

Remark: 

<v  =  hja|/(2fv)  =  element  Pedet  number 
h  -  element  mesh  parameter 

Interested  in  0  <  ft  <  oo;  a  »  1  is  viewed  as  “hard.” 

Vk  -  typical  finite  element  space  of  continuous,  piecewise 
polynomials  of  order  k. 

Point  of  departure:  Galerki.Vs  method 
Find  uk  €  such  that  for  all  €  Vh, 

B(wh,uh)  =  L(vju) 

where 

U(w\  uh)  '=  /  (-a  •  Vwh  uh  +  Vwh  ■  kVu*)  dQ 

Jo 

L{wh)  =‘  I  whf  dQ 
Ju 

Remark:  Galerkin’s  method  possesses  poor  stability  proper¬ 
ties  for  a  >  1.  Spurious  oscillations  are  generated  by  unre¬ 
solved  internal  and  boundary  layers. 

Gnlerkin/least-scpinres 

B(wk,uk)  +  J  klwh  r(Cuk  -  /)  dQ  =  L(u/k) 


SUPG 

B(wh,u'‘)  -|-  V*  /  (i  Vwk  t(Cu'‘  -  /)  d.Q  =  L{wk) 

r  "  S2' 

Classical  artificial  diffusion 

B(wh,'ih)  +  [  Vtu'1  •  KkVuh  dQ  -  L(wh) 

J  n 

Notations: 

ft*  =  domain  of  eth  element. 

Kh  =  artificial  diffusivity,  typically  0(/r). 
r  =  parameter  determined  by  convergence  analysis. 

=  0(/i/|a|)  for  ft  large. 

=  O {hr  I'k)  for  a  small. 

Remarks: 

1.  The  additional  terms  improve  upon  the  stability  of 
Galcrkin’s  method. 

2.  The  classical  artificial  diffusion  method  amounts  to 
overkill.  Accuracy  is  limited  to  first-order,  independent 
of  k. 

3.  Gnlerkin/least-squarcs  and  SUPG  arc  satisfied  if  uh  *—  u 
(“residual  methods”);  unlike  classical  artificial  diffusion. 

4.  Good  stability  and  higher-order  accuracy  are  combined  in 
Galerkin/least-squares  and  SUPG. 

•  Global  error  estimates  for  smooth  solutions: 
k  +  4  (at  least)  in  Lt(Q),  usually  k  1  in  practice. 

»  For  rough  solutions,  same  rates  are  observed  outside 
of  small  neighborhoods  of  layers  (“interior  estimates,” 
or  “localization  results”). 

•  Interior  estimates  are  impossible  for  Galcrkin.  Layers 
create  global  pollution. 

5.  Galerkin/least-squares  is  conceptually  simpler  than  SUPG 


and  slightly  easier  to  analyze. 
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Space-time  formulations 

Initial-value  problem  for  ivection-diffusion  equation: 

Ctu  *==  u,t  +  Cu  =  /  on  flx]0,T[ 
u(x,0)  =  uo(*) 

ti  =0  on  l'x]0,r[ 

Discontinuous  Galerkin  method  in  time 

Space  time  (i.e.,  flx]0, 7'[)  is  divided  into  time  slabs, 

nx]tnitn+i[,  where  0  =  to  <  ti  <  •  ■  •  <  tv  =  T. 


V"  =  piecewise  polynomials  of  order  k,  continuous  in  x,  but 
discontinuous  across  time  slabs. 

Point  of  departure:  Galerkin’s  method 

Find  uh  —  u^fx,  ,*)  such  that  for  all  to*  =  u>*(x,t), 

Bn(wh,uk)  =  Ln(wh),  n=-.  0,1 . N-  1 

where 

Bn{wh,uh)  ‘=  l‘ "+’  [  wkuk  dSl  +  B(wk,uh ))  dt 

+  [  w‘(*;+i)  AM  w 

Ln(uk)  ='  /‘"H  Hvt)  dt  +  /  uifc(t+)  «*(t-)  dfi 

JO 


Remark:  Continuity  of  the  solution  across  time  slabs  is  weakly 
enforced. 

Generalization  of  the  other  methods  proceeds  analogously  to 
the  steady  case.  For  example, 

Galerkin/least-squares 

B„(wh,uk)  +  J'”'  £  f  C,io1'  r{Ctuk  -  /)  dSidt  =  L„(u>*) 
Remarks: 

1 .  The  mathematical  convergence  theory  is  virtually  identical 
to  its  steady  counterpart. 

2.  Same  error  estimates  and  localization  results  hold  in  terms 
of  the  order,  k,  of  the  space-time  elements  employed. 

3.  The  issue  of  time  integrator  is  obviated  by  the  choice  of 
space-time  interpolation. 

4.  Unconditional  stability  is  achieved  for  any  choice. 

5.  Gives  rise  to  system  of  linear  algebraic  equations  on  each 
time  slab. 


«'‘(<o)=<  )  =  «o(*) 


*6  ft 
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B(WhtVh)^  L{Wk) 

B{Wk ,  VA)  J (-VWh  ■  A  V*  +  VVV'1  ■  KVVk)  dil 
L(Wh)'~  I  Wh  ■  T  dft 

J 12 

Galerkin/least-squares 

B(WktVk)  +  y\  /  C\Vh  r(CVh  -  F)  dQ  =  L{Wh) 

*7'  •'«* 

Remarks: 

1.  t  —  my.  to  symm.,  >  0  (Il.-Mallet,  1986). 

2.  Simple  arguments  reveal  that  r  should  not  be  diagonal, 
even  in  Id. 

Space-time  formulations  are  developed  in  identical  fashion  to 
the  scalar  case: 


Aq  =  LL1  ,  Cholesky  factorization 


Ao  =  block-diag  (/10,  •  •  • ,  do) 


Remark:  If  do  is  only  positive  jcmi’definite,  then  1  is  to 
be  understood  as  the  inverse  on  the  non-degenerate  subspace, 
e.g. 

* *>=[«  j]-then vd=  [co‘  ; 
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Incompressible  Euler  and  Navier-Stokes  Equations 
ns  a  Symmetric  Advectivc-Diflfusive  System 


V  ^ 


«L 

U2 

“3 

P 


Ao  =  p 


1 

0 

0 

0 


0  0  0 
10  0 
0  1  0 
0  0  0 


Ai 


t(ji4o  + 


e, 

0 


1  <  i  <  3 


e,  =  cartesian  basis  vector 


•ftfll  =  K22  =  H33  —  I'Aq  , 
Kij  =0  ,  i  ^  j 


2.  The  formulation  simplifies  considerably  due  to  the  sparsity 
of  the  arrays.  See  Szepessy,  1987,  and  Hansbo-Szepessy, 
1990,  for  an  analysis  and  numerical  results  for  a  related 
method  at  high  Reynolds  number,  and  Johnson-Snrinen, 
1986,  for  analysis  of  incompressible  Navier-Stokes  via  re¬ 
lated  formulations. 


3.  Assume  V),  =  0  and  Ai  = 


0  ei 

ef  0 


Then,  the  method 


becomes  a  nonaymmelric  Stokes  solver,  and  is  convergent 
for  all  continuous  pressure  interpolation  (i.e.,  no  BabuSka- 
Brezzi  condition).  Galcrkin/least-squares  and  related  ap¬ 
proaches  to  the  Stokes  problem: 

Brezzi-Pitkaranta,  1984 
H.-FVanca-Balestra,  1986 
H.-Pranca,  1987 


Brezzi-Douglas,  1988 
Franca-H.-Loula-Miranda,  1988 
Fianca-H.,  1988 
Pierre,  1988,  1989 
Franca, 1989 
Douglas-Wang,  1989 
Duran-Nochetto,  1989 
Sylvester-Kechkar,  1990 
Stenberg,  1990 
Franca-Stenberg,  1991 
Franca-H. -Stenberg,  1991 


Remarks: 

1.  r  is  very  simple  in  general.  Assuming 


r  0  0  0 

0  r  0  0 
0  0  t  0 
0  0  0  A 


Compressible  Euler  and  Navior-Stokos  equations 
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(U)  U,i  -t-  A  •  VU  -  V  •  KVU  =  f 

V  -  (p,  pill,  p»2,  pu3,  pe)r 

Remarks: 

1.  Neither  /I  nor  IC  is  symmetric  or  definite. 

2.  Classical  Li(U)  stability  estimates  are  derived  by  taking 
tlie  dot  product  of  (U)  with  U.  This  does  not  even  make 
dimensional  sense, 

5  5^a±JS+fi]+- 

??? 

This  suggests  that  th"  f,2(f2)-inner-product  structure  is 
inappropriate  for  compressible  Navier-Stokes  and  conse¬ 
quently  so  would  bo  a  classical  type  Calcrkin  formulation. 

Entropy  variables 

Godunov  1062,  Mock  1980,  Marten  1983,  Tadmor  1981,  Dutt 
1985,  H.  et  al.  1980,  Johnson  ct  al.  1987. 


11  =  H(U)--ps 

s  =  In  (^)  )  ~  non-dimensional  entropy 

q  r r 

V  =  =  ontiopy  variables 

Remarks: 

).  H  is  a  convex  function  of  U.  Thus  U  =  U(V)  is  a  well- 
defined  change  of  variables  which  transforms  (U)  to  (V). 

2.  The  dot  product  of  (V)  with  V'  results  in  the  Clausius- 
Duhem  inequality: 

0  =  V  •  (yi0V,  -1-  A  •  VV  -  V  ■  (KW)  -  >') 
implies 

(p’/).t  +  V  •  (p»;u)  -f  V  ■  -1-  >  0 

3.  The  space-time  formulation  inherits  this  property.  Replac¬ 
ing  H"*  by  Vk  results  in  a  global  statement  of  C.D.I. 

4.  In  practice,  the  term 

Wk  ■  (A0Vk4-A-  VV'1)  = 

Wh  •  ( U(Vklt  +  V  ■  JF(U(Vk)) ) 

appearing  in  B„( \Vh)  is  integrated-by-parts  over  each  time 
slab.  Global  conservation  is  attained  even  when  approxi¬ 
mate  element  quadrature  is  employed. 
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Nonlinear  operators  and  shock-capturing 
Remarks: 

1.  Galerkin/lcast-sqnares,  SUPG  are  linear  methods. 

2.  They  produce  approximations  to  discontinuities  like 


3.  No  linear  higher-order  accurate  method  will  produce  mono¬ 
tone  profiles. 

4.  The  idea  is  to  introduce  nonlinear  operators  in  order  to 
control  oscillations  about  discontinuities,  but  not  upset 
higher-order  accuracy  in  smooth  regions  ("locally  first- 
order”).  H.-Mallet-Mizukami  1980,  H.-Mallet  1980, 
Dutra  do  Carmo-Galeao  1987,  Johnson-Szcpessy  1986- 
1989. 


*  PVlW,‘  ■A()V(Vk  dQdt 

0  =  0{VK)  is  a  scalar 

=  a  non-dimensional  space-time  element  gradient,  operator 
/to  =  block-diag(/t0, . . . ,  /to) 

Examples: 

i.  o-iAV-n./i-i.v.v'i, 

where 

f  =  block-diag(r, . . . ,  r) 

|A|T  =(XrX)i 
\X\t  =(X-fA)i 

0.  ^2\C,V--r\\/\ViV^M 

where 

|AU,  =  (X-A0A)i 
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Conclusions 


Structure  of  the  Left-Hand-Side  Submatrices 


1.  a)  Galerkin/lcast-squares  is  an  effective  method  for 

advective-difi'usive  systems. 

b)  Nonlinear  shock-capturing  operators  arc  essential 
when  discontinuities  and  unresolved  sharp  layers  are 
present. 

c)  Mathematical  convergence  proofs  have  been  estab¬ 
lished  fur  all  linear  and  some  nonliucu, 

2.  Focal  points  of  research: 

•  Compressible  Fade:  and  Navier-Stokcs  equations. 

•  Generation  and  adaptive  refinement  of  unstructured 
meshes,  especially  in  three  dimensions. 

•  Design  and  analysis  of  effective  shock-capturing  oper 
ators. 

3.  Current  developments: 

•  More  attention  paid  to  turbulence  modelling. 

•  Commercially  available  compressible  flow  codes. 

•  Application  to  more  complex  systems,  in  particular, 
chemically  reacting  flows,  combustion,  MUD. 

•  Increased  emphasis  on  iterative  strategies  for  parallel 
architectures. 

•  Further  mathematical  analysis  of  algorithms  and  the 
development  of  design  principles  based  upon  the  math¬ 
ematical  theory  of  finite  elements. 

Multi-Element  Group  Partitioning 
(Domain  Decomposition) 

•  Algorithm  allows  different  solution  techniques  on  different 
subdomains 

•  For  example: 


•  Generalizable  to  arbitrary  number  of  element  groups 


Uncoupled  set  of  symm.,  pos.  def.  nodal  block  matrices 


Fully  coupled  -  stored  in  skyline  column  height  form 

rz=n_ 


Implicit/ iterative  clement  group 


A 


iter  _ 


A, ter 

Ai!" 


ait 


Fully  coupled  -  stored  in  '‘unassembled"  element  file 


Global  System  of  Equations 
•  Tlie  left-hand -side  matrix 


A'*/  no  0 

0  Ai\r  0  A?‘r 

0  0  A\T  A;'r 

o  A,';1'  A'<r  (A£r  +  A&r  +  A“") 


•  flic  right  -hand-side  vector 


i>7" 

b'!‘r 

Kr 


•  The  vector  of  unknowns 


xrxp 

_ifir 


(C 


ilrr 


X 


huity 


Partial  Reduction  of  the  System 


Explicit  element  group:  solve  for  a'11’ 


A\\p  x ””  = 


GMRES  Algorithm 

(Sand  and  Shultz  -  Mallet  et  al.) 


Search  for  solution  x  =  xc  4 -.x,  such  that 


min  116  -  A(xa  4-  z) 

Z€h' 


Implicit/direct  element  group:  statically  condense  xd,r 


Ai?  -  AiUAjYy'Aft 
bfr  btir  -  Ai\'(Ai\Tlbiir 


K  =  Span{r„,  Ar„, .  . .  ,  A*  1  »'n } 


r„  —  b  -  zlan 


♦  Calculate  the  orthogonal  basis  of  K, 


•  Reduce  the  system  of  equations  to 


AjiV  (A^p  +  A&  +  A&cr)  ]  xb'ldv 


(bf p  +  b:j,r  +  b‘2trr) 


Uk  ~  [tti.Uj . «t) 

using  Modified  Gram-Schniidt  orthogonalizalion 
Generate  a  lectangular  upper  Ilcssenberg  matrix,  Ht 


(k  4~  1 )  x  t 


Rewrite  as 


A  x  =  b 


such  that 


AU^V,nH, 


•  Reduce  the  minimization  problem  to 


Pre-preconditioning 


min  || b  -  A(.r„  4-  z)||  = 


Transform  system  to  enhance  convergence  of  iterative  solver 


nun  ||  e  -  II  „  y  ||  = 


•  Diagonal  pre-prcconditioning 


W  =fdiag(A) 


(IV-tAW'-iXVK**)  =  (W-i£) 


»  Block-diagonal  ore-preconditioning 


VV  '=  block(A)  = 


X 


Solve  the  above  problem  using  a  Qli  algorithm 


=  U1  U  (Cholesky  decomposition) 


(U~rAU~i){US)  =  {U-Tb} 


Preconditioning  on  Element  Group  Basis 


A 


Formally  replace  the  system  of  equations  by 

(L^AV^Vx)  =  ( L'b ) 
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•  Preconditioncr 

l  ^  n  i+9)  ;  v  =  n  u <*> 

S  =  >  »  =  TllV 

•  For  implicit/itcrative  element  group; 

Element-by-element  (EBE)  preconditioned 

n«i  L 

Lit,T  <|cf  JJ  .  jjiur  ae(  J-J  t-fe 

c=l  «=n«i 

•  Regularized  element,  arrays 

A'  =  Ac  -  Diag(Ac)  +  I 

•  Gauss-Seidel  EBE  (sum  decomposition) 

Le  +  UC  =  A‘  +1 

•  Nonsymmetric  “ Cholesky "  EBE  (product  decomposition) 

L’U‘  =  Ac 

Supersonic  Flow  over  a  Flat  Plate 


•  CFL  =  25 

•  CPU-time  (sec)  per  linear  sol  re 


Elements 

Direct 

Diagonal 

Block-diag. 

112 

1.5 

1.2 

0.3 

448 

12.2 

6.1 

0.9 

1,792 

102.7 

35.6 

5.0 

7,168 

1,174.4 

175.3 

24.5 

28,672 

. 

628.3 

130.9 

Computed  on  Convex-Cl 


•  Storage  of  iterative  solver  is  substantially  less  than  direct 
solver 

•  Effect  ot  EBE  preconditioning 

•  CPU-time  (sec)  per  linear  solve 
Block-diagonal  prc-preconditioning 


Elements 

No-EBE 

GS-EBE 

NC-EBE 

112 

0.3 

0.3 

0.4 

448 

0.9 

0.9 

1.3 

1,792 

5.0 

4.8 

6.5 

7,168 

24.5 

21.2 

30.4 

28,672 

130.9 

112.0 

263.0 

•  CFL  =  25 
1 ,792  elements 

Block-diagonal  pre-preconditioning 


ed  residua! 


idual 
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«  At  normalized  residual  10  l3; 


Conclusions 


Method 

Time  Steps 

CPU-Time 

Storage 

(min) 

(MByte) 

Explicit 

13,577 

356.33 

0.830 

Direct 

104 

118.05 

9.752 

No-EBE 

147 

17.42 

4.249 

GS-EBE 

132 

14.83 

4.249 

NC-EBE 

132 

16.53 

7.354 

•  Two-element  group  partitions 


•  Block-diagonal  pre-preconditioning  significantly  improves 
the  convergence  of  the  GMRES  algorithm. 

•  Implicit/iterative  algorithm  is  superior  to  both  explicit  and 
implicit/direct  algorithm,  even  for  very  small  problems. 

•  Implicit/explicit  partitioning  is  useful  for  making  optimal 
use  of  storage  and  CPU  resources. 

•  We  expect  implicit/iterative  algorithm  10  exhibit  even 
better  performance  on  large  three-dimensional  problems, 
but  further  research  in  reducing  storage  of  the  element  ar¬ 
rays  is  needed. 


Density,  p  D*sii:ty,  p  Density, 
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Abstract 

In  these  lecture  notes,  we  review  some  of  the  recent  progress'  on  stabilized  finite  element  formulations  used  in 
computation  of  incompressible  flows.  These  stabilization  techniques  are  used  to  prevent  die  numerical  oscillations  ttmt 
might  be  generated  by  the  presence  of  dominant  advcction  terms  or  by  inappropriate  combinations  of  interpolation 
functions  used  for  the  velocity  and  pressure.  The  stabilization  techniques  emphasized  in  these  lecture  notes  are  the 
Galerkin/lcast-squares,  streamlinc-upwind/Pctrov-Galerkin,  and  pressurc-stabilizing/Petrov-Palcrkin  formulations,  all  of 
them  arc  consistent  formulations  in  the  sense  that  an  exact  solution  still  satisfies  the  stabilized  formulation.  Some  of 
these  techniques  arc  based  on  finite  element  discretization  in  both  space  and  time.  Most  of  the  numerical  examples 
considered  are  unsteady  flow  problems,  with  emphasis  on  those  involving  moving  boundaries  and  interfaces,  such  as 
frce-surfacc  flows,  liquid  drops,  flow  past  an  oscillating  cylinder  and  flow  past  an  oscillating  airfoil.  Flow  past  a 
vertically  oscillating  cylinder  mounted  on  springs  is  solved  as  a  simple  but  fundamental  fluid-structure  interaction 
problem. 

Also  reviewed  arc  the  iteration  strategics  employed  to  solve  the  implicit  equation  systems  resulting  from  the  finite 
element  discretization  of  these  flow  problems,  including  those  discretized  by  using  the  space-time  formulation.  In  Utc 
space-time  formulation  the  finite  element  interpolation  functions  arc  discontinuous  in  time  so  that  the  fully  discrete 
equations  arc  solved  one  space-time  slab  at  a  time,  and  this  makes  the  computations  feasible.  Still,  the  computational 
cost  associated  with  the  space-time  finite  element  formulations  using  piecewise  linear  functions  in  time  is  quite  heavy. 
For  large-scale  problems  it  becomes  imperative  to  employ  efficient  iteration  methods  to  reduce  the  cost  involved.  This 
is  achieved  by  using  the  generalized  minimal  residual  (GMRES)  iteration  algorithm  with  the  clustered  element-by- 
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dement  (CHBli)  preconditioned.  The  CEBE  preconditioning  is  a  generalized  version  of  the  standard  clcmcnt-by-clcmcnt 
(EBE)  preconditioning.  In  the  CEBE  preconditioning  the  elements  arc  partitioned  into  clusters  of  elements,  with  a 
desired  number  of  elements  in  each  cluster,  and  (tic  iterations  ore  informed  in  a  clustcr-by -cluster  fashion.  The  number 
of  clusters  should  be  viewed  as  an  optimization  p ammeter  to  minimize  the  computational  cosl  Recently  we  have  been 
performing  this  type  of  computations  in  the  massively  parallel  environments  of  the  Connection  Machines  2  and  5.  For 
these  implementations  we  have  been  so  far  using  diagonal  prceonditioncrs.  We  will  include  some  examples  from  these 
massively  parallel  computations. 

In  these  lecture  notes  we  also  describe  a  new  mixed  CEBE/CC  preconditioning  method  for  finite  element 
computations.  The  CC  (cluster  companion)  preconditioning  method  shares  a  common  philosophy  with  die  multi-grid 
methods.  The  CC  preconditioners  arc  based  on  companion  meshes  associated  with  different  levels  of  clustering.  For  each 
level  of  clustering,  we  construct  a  CEBE  prcconditioncr  and  an  associated  CC  prcconditioncr.  Because  these  two 
preconditioners  in  a  sense  complement  each  other,  when  they  are  used  in  a  mixed  way,  they  can  be  expected  to  give 
better  performance.  In  fact,  our  numerical  tests,  for  two-  and  three-dimensional  problems  governed  by  the  Poisson 
equation,  demonstrate  that  the  mixed  CEBE/CC  preconditioning  tesults  in  convergence  rates  which  are,  in  most  eases, 
significantly  better  than  the  convergence  rates  obtained  with  the  best  of  the  CEBE  and  CC  preconditioning  methods. 


I.  Introduction 

The  purpose  of  these  lecture  notes  is  to  present  a  review  of  our  solution  strategics  for  incompressible  flows  using 
finite  elements.  These  strategies  include  the  stabilized  formulations,  the  space-time  finite  element  approach  to  flow 
problems  with  moving  boundaries  and  interfaces,  iteration  techniques  for  solving  the  implicit  equation  systems 
involved,  massively  parallel  implementations,  and  sophisticated  preconditioning  techniques.  The  description  of  the 
strategics  reviewed  in  these  lecture  notes,  and  the  numerical  results  reported,  have  mostly  been  extracted  from  recent 
articles  by  Tezduyar  ct  alJ*-\  Mittal  and  Tezduyar^  and  Bchr  ct  alJ.  The  numerical  examples  considered  here  arc 
unsteady  flow  problems,  including  those  involving  moving  boundaries  and  interfaces,  such  as  large-amplitude  sloshing, 
liquid  drops,  flow  past  an  oscillating  cylinder  and  flow  past  an  oscillating  airfoil. 

Finite  element  computation  of  incompressible  flows  involves  two  main  sources  of  potential  numerical  instabilities 
associated  with  the  Galcrkin  formulation  of  a  problem.  One  source  is  due  to  the  presence  of  advcciion  terms  in  the 
governing  equations,  and  can  result  in  spurious  nodc-to-nodc  oscillations  primarily  in  the  velocity  field.  Such 
oscillations  become  more  apparent  for  advcclion-dominatcd  (i.c.,  high  Reynolds  number)  flows  and  flows  with  sharp 
layers  in  the  solution.  The  other  source  of  instability  is  due  to  using  inappropriate  combinations  of  interpolation 
functions  to  represent  the  velocity  and  pressure  fields.  These  instabilities  usually  appear  as  oscillations  primarily  in  the 
pressure  field.  In  fact,  there  is  not  much  about  either  of  these  numerical  instabilities  that  could  be  considered  to  be 
inherent  to  the  finite  element  formulation.  Such  instabilities  appear  also  in  the  standard  versions  of  other  discretization 
techniques  such  ns  finite  difference  ami  finite  volume  methods. 

For  the  formulations  considered  in  these  lecture  notes,  the  stabilization  of  the  numerical  method  is  achieved  by 
adding  to  the  Oalerkiu  formulation  a  scries  ol  stabilizing  terms.  These  terms  can  be  obtained  by  minimizing  the  sum  of 
the  squared  residual  of  the  governing  equations  integrated  over  each  element  domain.  This  kind  of  a  stabilization  is 
irttowii  as  the  GLS  (Galcrkin/Lcast-squarcs)  stabilization.  This  abroach  has  been  successfully  applied  to  Stokes  flows**, 
compressible  flows^'*®,  and  incompressible  flows  at  finite  Reynolds  numbers-^.*  1*12  For  time-dependent  problems, 
a  stnc»  implementation  of  the  GLS  stabilization  technique  necessitates  finite  element  discretization  in  both  space  and 
time,  and  therefore  leads  to  a  space-time  finite  element  formulation  of  the  problem.  The  space-time  finite  element 
formulation  has  recently  been  successfully  used,  in  conjunction  with  the  GLS  stabilization,  for  various  problems  with 
fixed  spatial  domains.  We  con  give  as  example  the  work  of  Hughes  ct  al.*-*,  Hughes  and  Hulbcrt14,  Shaxib1®,  and 
Hansbo  and  Szepessy^. 


Perhaps  one  of  the  most  striking  applications  of  the  stabilized  space-time  finite  element  formulation  is,  as  it  was 
first  pointed  out  and  implemented  by  Tezduyar  et  al.2‘3,  in  computing  moving  boundaries  and  interfaces.  The  DSD/ST 
(Dcfonning-Spatial-Domain/Spacc-Timc)  procedure  introduced  by  Tezduyar  ct  al.^*3  serves  this  purpose  and  was 
successfully  applied  to  several  unsteady  incompressible  flow  problems  involving  moving  boundaries  and  interfaces, 
such  as  f tee-surface  flows,  liquid  drops,  two-liquid  flows,  flows  with  drifting  cylinders.  In  the  DSD/ST  procedure  the 
finite  element  formulation  of  a  problem  is  written  over  its  space-time  domain,  and  therefore  the  deformation  of  the 
spatial  domain  with  respect  to  time  is  taken  into  account  automatically.  Furthermore,  in  the  OSD/ST  procedure  the 
frequency  of  remeshing  is  minimized.  Here  we  define  remeshing  as  the  process  of  generating  a  new  mesh,  and  projecting 
the  solution  from  the  old  mesh  to  the  new  one.  Since  remeshing,  in  general,  involves  projection  errors,  minimizing  the 
frequency  of  remeshing  results  in  minimizing  the  projection  errors.  Furthermore,  minimizing  the  frequency  of 
remeshing  increases  the  massive  parallelization  potential  of  the  computations. 


Il  is  iin(X)rtam  to  icalizc  that  the  finite  element  interpolation  functions  me  discontinuous  in  time  so  that  the  fully 
discrete  equations  ire  solved  one  space-time  slab  at  a  time,  and  this  makes  the  computations  feasible.  Still,  the 
computational  cost  associated  with  the  space-time  finite  element  formulations  using  piecewise  linear  functions  in  time 
is  quite  heavy.  For  large-scale  problems  it  becomes  imperative  to  employ  efficient  iteration  methods  to  reduce  the  cost 
involved.  This  was  achieved  in  Liou  and  Tezduyar* 1  by  using  the  generalized  minimal  residual  (GMRES)*^  iteration 
algorithm  with  the  clustered  clcmcnt-by-elernent  (CEBE)  preconditioners.  We  will  review  such  preconditioning 
techniques  later  in  these  lecture  notes. 

Computation  of  time-dependent  incompressible  flow  problems,  over  fixed  spatial  domains,  can  be  performed  by 
using  the  finite  element  discretization  in  space  only,  rather  than  in  both  space  and  time.  In  this  case  we  first  consider 
the  GLS  stabilization  for  the  steady-state  equations  of  incompressible  flows.  Then  in  the  definition  of  the  stabilizing 
terms,  we  replace  the  residual  of  the  steady-state  equations  with  the  time-dependent  ones.  These  stabilizJnp,  terms  are 
added  to  the  Galerkin  formulation  of  the  time-dependent  equations.  If,  at  the  element  interiors,  we  further  neglect  the 
contribution  to  the  weighting  function  from  die  viscous  terms  (it  is  identically  zero  for  linear  velocity  interpolation)  we 
get  a  formulation  with  combination  of  SUPG  (strcamline-upwind/Pctrov-Galcrkin)  and  PSPG  (pressure- 
stabilizing/Pctrov-Galcrkin)  stabilizations.  The  former  prevents  the  numerical  oscillations  caused  by  the  presence  of 
advection  terms,  while  the  latter  allows  one  to  use  equal-order  functions  for  velocity  and  pressure  without  generating 
oscillations  in  the  pressure. 

The  SUPG  formulation  was  introduced  by  Hughes  and  Brooks*6.  A  comprehensive  description  of  the  formulation, 
together  with  various  numerical  examples,  can  be  found  in  Brooks  and  Hughes*7.  The  implementation  of  the  SUPG 
formulation  in  Brooks  and  Hughes*7  was  based  on  Q1P0  (bilinear  vclocity/constant  pressure)  elements  and  one-step 
time-integration  of  die  scmi-discrctc  equations  obtained  by  using  such  elements.  For  hyperbolic  systems  in  general,  and 
compressible  Euler  equations  in  particular,  tltc  SUPG  stabilization  was  first  reported  by  Tc/.duyar  and  Hughes***.  The 
SUPG  stabilization  for  the  vorticity-strcam  function  formulation  of  incompressible  flow  problems,  including  those 
with  multiply-connected  domains,  was  introduced  by  Tezduyar  et  al.*^. 

It  was  shown  that  (see  Brez.zi  and  Pitkaraiua2^,  and  Hughes  cl  al  2*),  with  proper  stabilization,  elements  which  do 
not  satisfy  the  Brez.zi  condition  can  be  used  for  Stokes  flow  problems.  The  Pcirov-Gulcrkin  stabilization  proposed  in 
Hughes  et  al.2*  is  achieved,  just  like  in  the  SUPG  stabilization,  by  adding  to  the  Galerkin  formulation  a  series  of 
integrals  over  element  domains.  The  PSPG  stabilization  term  proposed  in  Tczduyai  et  al.*  is  a  generalization,  to  finite 
Reynolds  number  flows,  of  the  Pctrov-Galcrkin  stabilization  tenn  proposed  in  Hughes  et  al.2*  for  Stokes  flows.  In 
Tezduyar  et  al.*,  the  SUPG  and  PSPG  stabilizations  arc  used  together  with  both  one-step  (Tl)  and  multi-step  (T6)  lime- 
integration  schemes22.  With  the  Tl  scheme,  the  SUPG  and  PSPG  stabilizations  arc  applied  simultaneously.  With  the 
T6  scheme,  on  the  other  hand,  the  SUPG  stabilization  is  applied  only  to  the  steps  involving  the  advcctive  terms,  and 
the  PSPG  stabilization  is  applied  only  to  the  steps  involving  the  pressure  terms.  Both  schemes  were  implemented  in 
Tezduyar  et  al.*  based  on  the  Q1Q1  (bilinear  velocity  and  pressure)  and  P1P1  (linear  velocity  and  pressure)  elements, 
and  were  successfully  applied  to  a  set  of  nearly  standard  test  problems. 

The  clcmcm-by-clcment  (EBU)  preconditioners,  which  arc  constructed  as  series  products  of  element  level  matrices, 
have  been  successfully  applied  to  several  clasps  of  problems2-*'26,  They  can  be  used  effectively  with  the  conjugate- 
gradient  and  GMRES*^  methods,  and  arc  highly  vectorizablc  and  parallelizablc2^27'2**.  They  can  also  be  used  together 
with  the  implicit-explicit  and  adaptive  implicit-explicit  time- integration  schemes26'2**^**.  In  CEBE  (clustered  element- 
by-clcmcnt)  preconditioning**^*,  the  elements  arc  merged  into  clusters  of  elements,  and  the  preconditioned  are 
constructed  as  series  products  of  duster  level  matrices.  In  Liou  and  Tezduyar**,  the  CEBE  preconditioning,  together 
with  the  conjugate-gradient  method,  was  used  for  solving  problems  with  symmetric  spatial  operators  (c.g.,  for  problems 
governed  by  the  Poisson  equation).  In  Liou  and  Tezduyar*  *,  tltc  CEBE  preconditioning  was  employed,  in  conjunction 
with  the  GMRES  method,  to  solve  compressible  and  incompressible  flow  problems.  Applications  to  the  space-time 
finite  element  formulation  of  incompressible  flows  were  included  in  Liou  and  Tezduyar*  *.  To  facilitate  vcctorization  and 
parallel  processing,  as  it  is  done  in  the  grouped  clemcnt-by-clcmcnt  (GEBE)  method27,  the  clusters  can  be  grouped  in 
such  a  way  that  no  two  clusters  in  any  group  arc  connected.  Furthermore,  depending  on  the  number  of  elements  in  the 
cluster,  within  each  cluster,  elements  can  again  be  grouped  in  the  same  way.  Each  cluster  matrix  is  formed  by 
assembling  together  the  element  level  matrices  associated  with  the  elements  in  that  cluster.  The  number  of  elements  in 
each  cluster  can  be  viewed  as  an  optimization  parameter  that  can  be  varied  to  minimize  the  computational  cost.  In  fact, 
in  Mittal  and  Tezduyar6,  the  unsteady  incompressible  flow  computations  were  performed  by  using  a  space-time  finite 
element  formulation  with  a  nearly  optimal  cluster  size  which  was  determined  by  numerical  experimentation. 


In  these  lecture  notes  we  review  the  CC  (cluster  companion)  preconditioning  introduced  by  Tczduynr  et  alA  In 
the  construction  process  of  the  CC  preconditioned,  we  first  start  with  a  "primary"  mesh  with  different  levels  of 
clustering.  For  each  level  of  clustering  in  this  primary  mesh,  we  define  a  "companion”  mesh,  such  that  each  cluster  of 
the  primary  mesh  forms  an  element  of  the  companion  mesh.  We  then  define  a  CC  preconditioner  bused  on  each 
companion  mesh,  such  that  there  is  a  CC  preeonditioncr  associated  with  each  CUBE  preconditioner  based  on  a  certain 
level  of  clustering.  This  way.  for  each  level  of  clustering,  we  obtain  a  CC  preeonditioncr  which  we  expect  U>  have  more 
intcr-clustcr  coupling  information  then  the  associated  CUBE  preconditioner  has.  Conversely,  tho  CUBE  preconditioner 
can  h>>  expected  to  have  more  intra-cluster  coupling  information  than  the  associated  CC  preeonditioncr  has. 

The  mixed  CUBE/CC  preconditioning  introduced  by  Tezduyor  et  al.^  is  based  on  the  belief  that  the  CEBE  and 
CC  preconditioners  complement  each  other,  and  tltcrefore  when  they  are  mixed  together  they  will  result  in  better 
convergence  rates.  ,rhc  mixed  preconditioning  con  be  implemented  by  using  these  two  preconditioners  alternately  at  each 
iteration  of  the  conjugate  gradient  medtod  or  at  each  outer  iteration  of  the  GMRES  method.  Recently  Saad^~  has 
formulated  a  new  version  of  the  GMRES  algorithm  which  allows  changing  the  preeonditioncr  at  every  inner  iteration. 
In  fact,  a  GMRES  subroutine,  based  on  this  new  formulation  and  made  available  to  us  by  Saad,  is  what  we  use  to 
implement  our  mixed  preconditioning. 


II.  The  Governing  Equations  of  Unsteady  Incompressible  Flows 


Let  fli  <=  Rnsc*  be  the  spatial  domain  at  time  t  c  (0,1),  where  nsti  is  the  number  of  space  dimensions.  Let  I") 
denote  the  boundary'  of  fit-  We  consider  the  following  velocity -pressure  formulation  of  the  Navier-Stokes  equations 
governing  unsteady  incompressible  flows: 

p(|jL+u- V  U)~  V  <r  =0  on  flt  Vte  (0.T) .  (1) 

V  •  u  =  0  on  Ot  Vte  (0.1)  ,  (2) 

where  p  and  u  are  the  density  ami  velocity,  and  a  is  the  stress  tensor  given  as 


o(p,  u)  =  -pl  +  2ge(u) 
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Here  p  and  p  are  the  pressure,  and  the  dynamic  viscosity,  and  I  is  the  identity  tensor.  The  part  of  the  boundary  at  which 
tire  velocity  is  assumed  to  be  specified  is  denoted  by  (l'i)g  : 

u  =  *  on  (TOg  Vte- (0.T).  (5) 

The  "natural"  boundary  conditions  associated  with  (I)  are  die  conditions  on  the  stress  components,  and  these  arc  the 
conditions  assumed  to  be  imjroscd  m  the  remaining  part  of  rite  boundary: 

n  <r  =  h  on  (r \)h  V  t  e  (O.T) .  (6) 

The  homogeneous  version  of  (6),  which  corresponds  to  the  "traction-free"  (i.c.,  zero  normal  and  shear  stress)  conditions, 
is  often  imposed  at  the  out  How  boundaries.  As  initial  condition,  a  divergence-free  velocity  field  u()(x)  is  specified  over 
the  domain  fit  at  t  =0: 

u(x,0) -- uo(x)  on  Qq.  (7) 

Let  us  now  consider  two  immiscible  fluids,  A  and  B,  occupying  the  domain  Qt.  Let  (fit) A  denote  the  subdomain 
occupied  by  fluid  A,  and  (Ti)a  denote  the  boundary  of  this  subdomain.  Similarly,  let  (flt)B  attd  (r t)B  be  the 
subdomain  and  boundary  associated  with  fluid  B,  Furthermore,  let  (Pt)AB  be  the  intcrscctio"  of  (l't)A  and  (Pt)B,  i.c., 
rite  interface  between  fluids  A  and  B, 


The  kincmatica!  conditions  at  the  interface  (r\)AB  are  based  on  the  continuity  of  the  velocity  field.  The  dynamical 
xmditions  at  the  interface,  for  two-dimensional  problems,  can  be  expressed  by  the  following  equation: 

«A-  <*A  +  nB'  <*B  =  “A  y  /  Ra  cn  (r,)AB  V  t  e  (0,T)  .  (8) 

where  da  and  03  are  the  unit  outward  normal  sectors  at  the  interface,  oa  and  03  are  the  stress  tensors,  y  is  the  surface 
tension  coefficient,  and  Ra  is  the  radius  of  curvature  defined  to  be  positive  when  da  points  towards  the  center  of 
curvature.  The  condition  (8)  is  applicable  also  to  free-surface  flows  (i.e.,  when  the  second  fluid  docs  not  exist),  provided 
that  subdomain  (&t)A  *s  the  one  assigned  to  be  occupied  by  the  fluid. 

III.  The  Space-Time  Formulation  with  the  Galerkin/Least-squares  Stabilization 
and  Application  to  Moving  Boundaries  and  Interfaces:  the  DSD/ST  Procedure 

A .  The  method 

In  the  space-time  finite  clement  formulation,  the  time  interval  (Q.'r)  is  partitioned  into  subintervals  In  =  (tn.tn+l). 
where  tn  and  tn+ 1  belong  to  an  ordered  scries  of  time  levels  0  =  to  <  li  <  ...  <  in  =  T.  It  was  first  shown  in 
Te/.duyar  et  al.^3  that  the  stabilized  space-time  finite  element  formulation  can  be  effectively  applied  to  fluid  dynamics 
computations  involving  moving  boundaries  and  interfaces.  In  this  formulation  the  spatial  domains  at  various  time 
levels  are  allowed  to  vary.  We  let  Qn  =  Utn  and  Fn  =  I'tn,  and  define  the  space-time  slab  Qn  as  the  space-time  domain 
enclosed  by  the  surfaces  Qn,  £ln+l  and  Pn  (see  Figure  1).  Here  Pn,  the  lateral  surface  of  Qn,  is  the  surface  described  by 
the  boundary  T,  as  t  traverses  In .  Similar  to  the  way  it  was  represented  by  equations  (5)  and  (6),  Pn  is  decomposed  into 
(T'n)^  and  (Pn)ft  with  respect  to  the  type  of  boundary  condition  bcine  imposed. 

in+i 
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Figure  1.  The  space-time  slab  for  the  DSD/ST  formulation 

(» 

Finite  element  discretization  of  a  space-time  s'ab  Qn  is  achieved  by  dividing  it  into  elements  Qn,  c=l,2 . (nci)n. 

where  (nd)n  is  the  number  of  elements  in  the  space-time  slab  Qn .  Associated  with  this  discretization,  for  each 
space-time  slab  wc  define  the  following  finite  element  interpolation  function  spaces  for  the  velocity  and  pressure  : 


(Sj.'n  -  {  uh  1  ufl  €  [  (Qn))  uh  =  gh  on  (Pn)g  )  . 

(9) 

(Vy)n  -  (  wh  i  wh  €  [  Hlh  (Qn»  n«l,  wh  =  0  on  <Pn)g  )  . 

(10) 

(Sj)n=(v5)n=(q»>lqhe  Hlh(Qn))  . 

(ID 

Here  fll*1  (Qn)  represents  the  finite-dimensional  function  space  over  the.  space-time  slab  Qn-  This  space  is  formed  by 
using,  over  the  parent  (element)  domains,  first-order  polynomials  in  space  and  time.  It  is  also  possible  to  use 
zeroth-ordcr  polynomials  in  time.  In  either  case,  globally,  the  interpolation  functions  arc  continuous  in  space  but 
discontinuous  in  lime. 
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The  space-time  formulation  of  (l)-(8)  can  be  written  as  follows:  start  with 

(uh)q  =  (uo)h  ; 


(12) 


_  h  h  h 

sequentially  for  Qi,Q2.  •••.QN-1»  g‘v«n  (u*1'^ ,  find  u^  6  (Su)n  and  p^e  (Sp)n  ,  such  that  V  wh  e  (Vu)n 

and  V  qh  e  (V^)n 
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p  +  uh- V  uh  ^)- V  afp^.u^jJ  dQ  =  0  , 

where  (Pn)AB  is  the  space-time  surface  described  by  the  boundary  (H)ab  as 1  traverses  the  lime  interval  Ondn+l) 
In  the  \ariational  formulation  given  by  (13),  the  following  notation  is  being  used: 

(uhr  -  lim  $->o  uh(tn  ±  $) . 


(13) 


|  (...)  dQ  =  j  {(.,.>  dQ  dt  , 
On  . 

*n  *■» 


j(...)dP=  |  /(.  )  dr  dt  . 
l*n  j  p 

*n  1 


(14) 


(15) 


(16) 


Remarks 

1 .  If  we  were  in  a  standard  finite  element  formulation,  rather  tha*:  a  space-time  one,  the  Galcrkin  formulation  of  (1)- 
(8)  would  have  consisted  of  the  first  five  integral.!  (their  spatial  versions  of  course)  appearing  in  equation  (13).  In 
the  space-time  formulation,  because  the  interpolation  functions  are  discontinuous  in  time,  the  sixth  integral  in 
equation  (13)  enforces,  weakly,  the  continuity  of  the  velocity  in  lime.  The  renaming  scries  of  integrals  in 
equation  '  nrc  die  least-squares  terms  added  to  the  Galcrkin  variational  formulation  to  assure  the  numerical 

labile  ■  ‘  ‘computations.  The  coefficient  t  determines  the  weight  of  such  added  terms. 

2.  This  kind  e  utilization  of  the  Galcrkin  formulation  is  referred  to  as  the  GalcrkMeasl-squarcs  (GLS)  procedure, 
and  can  be  considered  as  a  generalization  o'  the  stabilization  based  on  the  strcamlinc-upwind/Pe^ov-Galcrkin 


i  7 


(SUFG)  procedure  employed  for  incompressible  flows.  It  is  with  such  stabilization  procedures  that  it  is  possible 
to  use  elements  which  have  equal-order  interpolation  functions  for  velocity  and  pressure,  and  which  are  otherwise 
unstable. 

3.  It  is  important  to  realize  that  the  stabilizing  terms  added  involve  the  momentum  equation  as  a  factor.  Therefore, 
despite  these  additional  terms,  an  exact  solution  is  still  admissible  to  the  variational  formulation  given  by 
equation  (13). 

The  coefficient  x  used  in  this  formulation  is  obtained  by  a  simple  multi-dimensional  generalization  of  the  optimal 

t  given  in  Shakib1®  for  one-dimensional  space-time  formulation.  The  expression  for  the  t  used  in  this  formulation  is 

(.7) 

where  v  is  the  kinematic  viscosity,  and  At  and  h  are  the  temporal  and  spatial  "clement  lengths". 

Remarks 

4.  Because  the  finite  element  interpolation  functions  are  discontinuous  in  time,  the  fully  discrete  equations  can  be 
solved  one  space-time  slab  at  a  time.  Still,  the  memory  needed  for  the  global  matrices  involved  in  this  method  is 
quite  substantial.  For  example,  in  two  dimensions,  the  memory  needed  for  space-time  formulation  (with 
interpolation  functions  which  arc  piecewise  linear  in  time)  of  a  problem  is  approximately  four  times  more 
compared  to  using  the  finite  element  method  only  for  spatial  discretisation.  However,  iteration  methods  can  be 
employed  to  substantially  reduce  the  cost  involved  in  solving  the  linear  equation  systems  arising  from  th;  space- 
time  finite  clement  discretization. 

5.  The  kincmatical  conditions  at  the  interface  (rt)AB  arc  automatically  satisfied  because  the  discretized  subdomains 
(iit)A  and  (UOb  share  the  same  nodes  at  this  interface. 

6.  The  additional  term  (i.e.,  the  fourth  integral)  in  equation  (13)  enforces  the  dynamical  conditions  associated  with 
the  interfaces  and  frcc-surfaccs  in  the  presence  of  surface  tension  effects.  If  the  interface  is  to  be  interpreted  as  the 
frec-surfacc  of  a  single  fluid,  then  the  fluid  is  assumed  to  occupy  subdomain  (&t)A  ■  This  variational 
formulation  can  of  course  be  easily  extended  to  more  than  two  fluids. 

7.  For  two-liquid  flews,  the  solution  and  variational  function  spaces  for  pressure  should  include  the  functions  which 
are  discontinuous  across  the  interface. 

B.  Application  to  flows  involving  moving  bodies 

As  a  special  case  of  moving  bodies  let  us  now  consider  a  freely  moving  cylinder.  The  cylinder  moves  with 

unknown  linear  velocity  components  Vi  and  V2  and  angular  velocity  6.  The  temporal  evolutions  of  these  additional 

unknowns  depend  on  the  flow  field  and  can  be  described  by  writing  the  Newton's  law  for  the  cylinder: 


dvj  >(Vi.V2.e.u) 
dt  m 

(18) 

dV2  MV1.V2.8.  U) 
dt  m 

(19) 

d8  T  (Vi,  V2.e,tl) 
dt  ~  J 

(20) 

where  D,  L,  and  T  are  the  drag,  lift  and  torque  on  the  cylinder,  while  m  and  J  arc  its  mass  and  polar  moment  of  inertia. 
The  vector  of  nodal  values  of  velocity  and  pressure  is  denoted  by  U.  Temporal  discretization  of  equations  (18)-(20)  leads 
to  a  set  of  equations  which,  in  an  abstract  form,  can  be  written  as 


>8 


V  -  V  =  At  D  (V  .  V,  U) 


(21) 


Here  V  (unknown)  and  V  (known)  are  vectors  representing  the  motion  of  the  cylinder,  respectively,  inside  the  current 
space-time  slab  and  at  the  end  of  the  previous  one.  The  current  slab  thickness  tn+i  -  tn  is  AL  For  linear-in-time 
interpolation,  equation  (21)  takes  the  form 
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(22) 


Based  on  the  general  expression  (21),  we  can  write  the  incremental  form  of  (22)  as 

-  At(|®)AU  -8  (l-At(|^))AV  =  RV  (U,V).  (23) 

Equation  (23)  is  of  course  coupled  with  the  incremental  form  of  the  discrete  equation  system  resulting  from  (13): 


(Myy)  AU(,)  +  (M  y y)  AV  =  Ru  (U.V) 


(24) 


In  computations  reported  in  this  article,  the  system  (23)-(24)  is  solved  by  a  block  Iteration  scheme  in  which  the  term 


(£)  is  neglected.  During 


each  iteration,  equation  (24)  is  solved  for  AD  only,  using  the  value  of  V  from  the 


previous  iteration;  and  then  V  is  updated  by  (23)  while  U  is  held  constant.  However,  the  full  system  can,  in  principle, 
be  solved  simultaneously  to  take  advantage  of  larger  time  steps  afforded  by  a  fully  implicit  method.  Iterating  on  the 
solution  will  still  be  needed  not  only  because  of  the  nonlinear  nature  of  (1),  but  also  because  of  the  dependence  of  the 

c 

element  domaias  Qn  on  the  vector  V. 


Remark 


8.  In  the  DSD/ST  procedure  to  facilitate  the  motion  of  frcc-surfaccs,  interfaces  and  solid  boundaries,  we  need  to 
move  the  boundary  nodes  with  the  normal  component  of  the  velocity  at  those  nodes.  Except  for  this  restriction, 
wc  have  the  freedom  to  move  all  the  nodes  any  way  wc  would  like  to.  With  this  freedom,  we  can  move  the  mesh 
in  such  a  way  that  we  only  need  to  rcmes!  vhen  it  becomes  necessary  to  do  so  to  prevent  unacceptable  degrees  of 
mesh  distortion  and  potential  entanglements.  By  minimizing  the  frequency  of  remeshing  we  minimize  the 
projection  errors  expected  to  be  introduced  by  remeshing.  In  fact,  for  some  computations,  as  a  byproduct  of 
moving  the  mesh,  wc  may  be  able  to  get  a  limited  degree  of  automatic  mesh  refinement,  again  with  minimal 
projection  errors.  For  example,  a  mesh  moving  scheme  suitable  for  a  single  cylinder  drifting  in  a  bounded  flow 
domain  is  described  in  Tc/.duyar  ct  alA  Also  by  minimizing  the  frequency  of  retrvshing,  we  increase  the  massive 
parallelization  potential  of  the  computations. 
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IV.  The  Formulations  with  the  SUPG  and  PSPG  Stabilizations 

The  space-time  formulation,  described  in  the  previous  section,  has  the  advantage  of  being  able  to  handle  flow 
problems  involving  moving  boundaries  and  interfaces,  but  is  quite  cosily  for  large-scale  problems.  The  CEBE  iteration 
technique  can  be  used  to  reduce  the  cost  substantially.  For  problems  that  do  not  involve  any  moving  boundaries  and 
interfaces,  one  can  use  even  less  costly  formulations.  These  formulations  are  based  on  finite  element  discretization  in 
space  only,  rather  than  in  both  space  and  time.  In  this  section  the  variational  formulations  with  the  SUPG  and  PSPG 
stabilization  terms  arc  described. 

Associated  with  the  finite  element  discretization  of  ft,  we  define  the  following  finite  element  interpolation  function 
spaces  for  the  velocity  and  pressure : 


sjj  =  {  uh  1  uh  e  [  Hlh  (ft)]  nsd,  uh  i  gh  on  rF  }  , 

(25) 

vjj  =  {  wh  1  wh  €  [  Hlh  (ft)]  "sd,  wh  =  0  on  rg  }  , 

(26) 

Sp=vJ=  {qh|qh€  (ft)}. 

(27) 

Here  Hlh  (ft)  represents  die  finite-dimensional  function  space  over  the  spatial  domain  ft.  This  space  is  formed  by 
using,  over  the  element  domains,  first-order  polynomials  in  space.  The  stabilized  Galerkin  formulation  of  (l)-(7)  can 

be  written  as  follows:  find  uh  e  sj|  and  ph  e  ^  ,  such  that  V  w*1  e  and  Vq*>6 


I  wh'P  f^T  +  “h  V  u*0  dft  +  [  e(w^) :  ofp^.u*1)  dft  -  fwh-Adr 

ft  V ®l  J  Si 


+  |  qh  p  Vuh  dft 


ft 


+  I  J  (Sh  +  *h)  - 


6=1  fte 


uh-V 


V  o(ph,uh)j 


dft  = 


0 


(28) 


As  it  can  be  seen  front  equation  (28),  two  stabilizing  terms  have  been  added  to  the  standard  Galerkin  formulation  of  (1)- 
(7);  the  one  with  8^  is  the  SUPG  term,  and  the  one  with  e*1  is  the  PSPG  (prcssurc-sta*'ilizirig/Pcirov-Galerkin)  term. 
For  definitions  of  the  Pctrov-Galcrkin  functions  5h  and  th  sec  Tezduyar  et  alA 

The  spatial  discretization  of  equation  (28)  leads  to  the  following  set  of  non-linear  ordinary  differential  equations: 

(M  +  M$)  a  +  N(v)  +  N5(v)  +  (K  +  K§)  v  -  (G  +  G§)  p  =  F  +  F$  ,  (29) 

GTv  +  Mea+  Ne(v)+  Ke  v+  Ge  p  =  E  +  Ee  ,  (30) 

where  v  is  the  vector  of  unknown  nodal  values  of  uh,  a  is  the  time  derivative  of  v,  and  p  is  the  vector  of  nodal  values 
of  ph.  The  matrices  M,  N,  K  and  G  arc  derived,  respectively,  from  the  umc-dcpcndcnt,  advective,  viscous,  and  pressure 
terms.  The  vector  F  is  due  to  the  boundary  conditions  (5)  and  (6)  (i.e.,  the  g  and  h  terms),  whereas  the  vector  E  is 
due  to  the  boundary  condition  (S).  The  subscripts  5  and  e  identify  the  SUPG  and  PSPG  contributions,  respectively. 
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Let  us  consider  the  time-integration  of  equations  (29)  and  (30)  by  a  one-step  generalized  trapezoidal  rule;  i.e., 
given  (uh^,  find  (uh^+1  and  (ph^.+1  (this  will  be  referred  to  asTl  formulation).  When  written  in  an  incremental 


form,  the  T1  formulation  leads  to 

M*  Aa-G*  Ap  =  R  ,  (31) 

(GT)*  Aa  +  Ge  Ap  =  Q  ,  (32) 

where 

R  =  F  +  Fg  -  [(M  +  Mg)  a  +  N(v)  +  Ng(v) 

+  (K  +  Kg)  v  -  (G  +  Gg)  p]  ,  (33) 

Q  =  E  +  Ee  -  [GT  v  +  Me  a  +  Ne(v)  +  Ke  v  +  Ge  Pi  .  (34) 

M*  »  M  +  Mg  +  a  At  (|~+  +  K  +  Kg  )  .  (35) 

G*  =  G  +  Gg  .  (36) 

(G1)*  =  Me  +  a  At  ( Ke+  GT  )  .  (37) 


The  parameter  a  controls  the  stability  and  accuracy  of  the  time  integration  algorithm. 
Remarks 


9.  The  equation  systems  (31)  and  (32)  can  be  solved  by  treating  the  velocity  explicitly  in  the  momentum  equation. 
Since  the  SUPG  and  PSPG  supplements  arc  applied  to  all  terms  in  the  momentum  equation,  in  explicit 
computations  the  coefficient  matrix  of  the  pressure  equation  is  generally  not  symmetric.  All  explicit  T1 
computations  reported  in  this  paper  are  based  on  the  symmelrization  of  the  coefficient  matrix  of  the  pressure 

equation,  and  the  results  arc  obtained  with  2  passes  per  lime  step.  In  such  computations  M  ,  G  and  (G  )  are 


replaced  with 

M*  =  ML  . 

(38) 

G*  =  G  , 

(39) 

(GT)*  =  o  At  GT  , 

(40) 

where  is  the  lumped  version  of  the  mass  matrix  M  . 

10.  One  can  also  write  a  multi-step  (T6)  time  integration  formulation  for  equations  (l)-(7).  In  the  T6  formulation 
the  SUPG  term  is  applied  only  to  the  sub-steps  involving  the  odvectivc  terms.  The  PSPG  term,  on  the  other 
hand,  is  applied  only  to  the  sub-steps  involving  the  pressure.  For  details  of  the  T6  formulation  and  its 
performance  see  Tezduyar  ct  al.  * 
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V.  CEDE  (Clustered  Element-by-Element)  Preconditioning 
Consider  a  linear  equation  system 

A  x  =  b  (41) 

encountered  in  finite  element  computation  of  a  problem.  Based  on  the  finite  element  discretization  of  the  problem 
domain  SI,  the  matrices  A  and  b  arc  formed  by  adding  together  their  element  level  constituents;  i.e., 

nel 


A 


"cl 

b  =  Xbc 

_  .  * 


(42) 


(43) 


where  nd  is  the  number  of  elements. 

Remarks 

1 1.  The  domain  &  can  also  be  a  space- time  domain,  in  which  case  the  elements  arc  space-time  elements. 

12.  The  element  level  matrices  Ae  and  be  have  die  same  dimensions  as  the  global  matrices  A  and  b,  respectively; 
i.c.,  ncq  x  nCq  and  nCq  x  1 ,  where  nCq  is  the  number  equations.  However,  the  only  non-zero  entries  for  these 
clement  level  matrices  arc  those  corresponding  to  the  nodes  of  element  c,  and  this  fact  is  taken  into  account  in  the 
implementation. 


Wc  assume  that  direct  solution  of  (41)  is  not  computationally  feasible  and  dial  we  would  like  to  design  a  good 
preconditioner  to  maximize  the  efficiency  of  the  iterative  solution  procedure.  To  achieve  this,  first  wc  rewrite  (41)  in  a 
sealed  form 


A  k  =  b  , 


where 


A  =W~1/2  A  W  1/2 


;  =  W~1'2b, 


;  =w*'2 


The  scaling  matrix  W  is  defined  as 


(44) 


(45) 

(46) 

(47) 


W  =  diag  A  . 


(48) 


Remarks 


13.  This  definition  for  the  scaling  matrix  W  is  a  good  one  when  the  matrix  A  is  positive-definite.  However,  when  A 
is  not  positive-definite,  the  following  alternative  definition^  can  be  used: 

W  =  tump  M  (49) 

where  lump  M  is  the  lumped  version  of  the  mass  matrix  M.  It  is  perhaps  reasonable  to  look  into  defining  a 
scaling  matrix  based  on  a  combination  of  (48)  and  (49). 

14.  In  scaling  a  matrix,  no  matter  what  the  level  of  that  matrix  is,  the  global  seating  matrix  W  is  the  same.  For 
example,  the  element  level  matrices  Ac  and  bc  are  sealed  as 


Ac  =W  1/2  AC  W~1/2  , 

(50) 

bc  =W  -1^2  bC  . 

(51) 

The  matrix  A  can  bc  expressed  as 

ncl 

A  =  W  >  X  (Ac  -  Wc) 

0=1 

(52) 

In  the  sealed  form  this  expression  becomes 

«cl  „ 

A  =  I  +  X  B  c  , 

C=1 

(53) 

where 


B°=AC-WC  ,  e  =  1,  2 . net.  (54) 

Theclcmcnt-by-clcmcnt  (EBE)  preconditioning  is  based  on  the  approximation  of  (53)  by  a  sequential  product  of 
element  level  matrices.  Earlier  implementations  can  bc  seen  in  Hughes  ct  al.23-24  various  vectorized  versions  and 
applications  to  three-dimensional  problems  can  bc  found  in  Hughes  and  Fcrcnc/,25.  Parallel  implementation  of  the 
method  is  achieved  in  Tezduyar  cl  al.^  based  on  the  grouped  clcmcnt-by-clcment  (GEBE)  approach^?,  in  which 
elements  arc  ordered  in  groups  with  no  inter-element  coupling  within  each  group.  The  number  of  groups  Is  minimized 
to  minimize  the  overhead  associated  with  synchronization  in  parallel  computations.  Applications  in  conjunction  with 
the  implicit-explicit  and  adaptive  implicit-explicit  element  grouping  can  bc  seen  in  Tezduyar  and  Liou^,  Tezduyar  ct 
al.28,  Shakib  ct  al.29  and  Liou  and  Tezduyar^.  Depending  on  the  form  of  matrix  A,  the  EBE  type  preconditioned  can 
bc  used  with  the  conjugate  gradient,  GMRES  *  or  some  other  sophisticated  search  algorithm. 
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Figure  2.  Four  different  levels  of  clustering  for  a  uniform  16  x  16  mesh;  in  each  frame  the  thick  lines  depict  the 
cluster  boundaries  and  the  associated  companion  mesh. 


In  the  CEBE  (clustered  clcmcnt-by-clcmcnt)  method  the  set  of  elements  e  is  partitioned  into  clusters  of  elements 
ej,  J  =  1,2, Nci.  For  example,  Figure  2  shows  four  different  levels  of  clustering  for  a  uniform  16  x  16  mesh.  The 
cluster  boundaries  are  marked  with  thick  lines.  In  the  first  frame  each  cluster  consists  of  one  clement,  and  therefore  this 
would  lead  to  an  F.BE  method.  In  the  last  frame  the  cluster  size  is  8  x  8;  the  next  level  of  clustering  after  that  (i.e.,  level 
S)  would  lead  to  a  direct  solution  method.  The  global  matrix  \j  associated  with  cluster  J  is  defined  as 


(55) 


V|  4 


Aj  =  E  AC  . 
e  <=ej 

The  matrix  A  can  then  be  expressed,  sitnilai  to  (52),  as 


Net 

A  *  w  +  X(aj"Wj)  •  (56) 

J=1 

In  the  sealed  fonn  this  expre.  becomes 

Nd. 

A  =1+  S  Bj  , 

J=1 

where 

Bj  =  Aj  -  Wj  , 


J=  1.2 . Nc| . 


(57) 

(58) 


The  CEBE  preconditioning  is  based  on  the  approximation  of  (S7)  by  a  sequential  product  of  cluster  level 
matrices.  Here  we  give  two  examples  (see  Liou  and  Tezduyar*  *  >3*);  2-Pass  CEBE  preconditioner  and  Crout  CEBE 
preconditioner.  The  2-Pass  CEBE  pneconditioncr  is  defined  as 

Nc,  1 

PC  =  n(It-^Bj)  n  (I  +  ~  Bj)  ,  (59) 

J-l  J-Nc, 

and  tlic  Crout  CEBE  preconditioner  is  defined  as 
„  Nc|  A  Ncj  A  1  A 

»*c  -  n.Lj  n  o,  n  u,  ,  «*» 

J  =  1  J=  I  J-Nc| 

A  A  A 

where  Lj  ,  D  j  and  Uj  are  the  matrices  resulting  from  the  following  Crout  factorization: 

~  AAA 

I  +  Bj  =  LjDjUj  ,  J  ~  1,2 . Ncl.  (61) 

In  Liou  and  Tezduyar^,  these  types  of  preconditioning  were  used,  in  conjunction  with  the  conjugate  gradient  method, 
for  problems  governed  by  the  Poisson  equation.  In  Liou  and  Tezduyar*  *  they  were  used,  together  with  the  GMRES 
method,  for  compressible  and  incompressible  (low  problems.  Of  course  the  convergence  rates  depend  on  the  cluster 
sizes.  In  Mittal  and  Tezduyai®,  for  the  space-time  finite  element  formulation  of  an  incompressible  flow  problem,  an 
optimal  cluster  size  was  determined  by  numerical  experimentation  and  was  used  in  the  computations. 


VI.  CC  (Cluster  Companion)  Preconditioning 

Let  us  consider  a  mesh  with  different  levels  of  clustering.  For  each  level  of  clustering  in  this  "primary"  mesh,  we 
define  a  "companion"  mesh,  such  that  each  cluster  of  the  primary  mesh  forms  an  element  of  the  companion  mesh.  For 
example,  Figure  2  can  now  also  be  seen  as  showing  the  companion  meshes  associated  with  four  different  levels  of 
clustering  in  a  16  x  16  primary  mesh.  In  each  frame  of  Figure  2,  the  thick  lines  not  only  mark  the  cluster  boundaries 
for  a  certain  level  of  clustering,  but  also  depict  the  companion  mesh  associated  with  that  level  of  clustering.  In  the  first 
frame  the  companion  mesh  is  the  same  as  the  primary  mesh.  In  the  last  frame  the  companion  mesh  is  a  2  x  2  mesh.  In 
our  notation,  the  level  of  clustering  and  the  associated  companion  mesh  will  be  identified  by  the  same  integer  number, 
i.c.,  companion  mesh  I  will  be  associated  with  clustering  level  I. 

Because  the  companion  mesh  1  is  the  same  as  the  primary  mesh,  equations  (4 1)-(43)  can  also  be  written  as 


1  1  1 

(A)  (x)  =  (b) 


(62) 


.1- 1 5 


(A) 


1 


("cl)1  , 

I  <AC) 
c=l 


(63) 


1 

(b) 


("cl)1  , 

I  (bc) 


c=l 

where  the  superscript  "1"  denotes  the  companion  mesh  number. 


(64) 


1 

let  (u)  bc  the  approximation  of  a  displaccmcnt-like  scalar  field  u  over  the  companion  mesh  1,  such  that 


1  (>)1  j  i 

(“)  "  Z  (N)b(»)b  t  (65) 

1  I 

where  (nnp)  is  the  number  of  nodal  points  in  companion  mesh  l,  (N)q  is  the  shape  function  associated  with  node  B, 
1  1 

and  (u)q  is  the  value  of  (u)  at  node  B.  A  similar  expression  can  be  written  to  approximate  u  over  the  companion  ntesh 

2: 


2  ("np>2  2  2 

(u)  a  £  (N)q  (u)rj  (66) 

B=1  -  KV 

11  1  2  2  2  2 

Let  (u)  =  {  (u)b,  B  =  1, 2 .  (nnp)  )  and  (w)  =  { (u)B  ,  B  =  1,  2 .  (nnp)  ).  Given  (u)  ,  we  would  like  to 

1  1  2 
obtain  an  expression  that  approximates  (ii)  Based  on  least-squares  minimization  of  the  difference  (u)  -  (u)  ,  we 
obtain 


11  1  12  2 

M  (u)  =  M  (u)  , 


where 

11  f  1  1  1 

(M  )AB  “  J  (N)A  (N)r  dft  .  A,  B  =  1,2 .  (n„p)  , 

ft 


12  f  1  2  1  2 

(M  )AB  -  J  (N)a  (N)b  .  A  =  1,  2 .  (nnp)  ,  B  =  1,  2 .  (nnp)  . 

ft 


From  (67),  we  can  write 


(67) 

(68) 


(69) 


1  12  2 
(u)  s  E  (u) 


(70) 


VI 6 


with  the  "interpolation”  matrix  E  denned  as 


12  II  -t  12 
K  =  PM  ]  M 


12  11-1  12 
E  =  [tump  M  ]  M 


An  expression  similar  to  (70)  can  be  written  to  obtain 


2 

21  1 

(u) 

=  E  (u) 

with 

21 

22-1  21 

E 

■  [M  ]  M 

or 

21 

22  -t 

E 

=  [lump  M  1 

when?. 


(71) 

(72) 

2  1 
(u)  front  (u)  : 

(73) 

(74) 

(75) 


22  f  2  2  2 

<M  )AB  =  J(N)A(N)B«lft  .  A,  B  =  I,  2 .  (nnp)  , 

U 


(M21)AB  =  J  (N)a  (N)b  dO) ,  A  =  1 , 2 .  (nRp)2.  B  -  1, 2, .  (nnp)1 

ft 


(76) 


(77) 


12  21 

More  on  Ute  derivations  related  to  E  and  E  can  be  found  in  the  Appendix.  Assuming  that  the  Dirichlet  type 
Itoumlary  conditions  arc  somehow  taken  care  of  in  the  implementation,  we  can  also  use  equations  (70)  and  (73)  to  obtain 
1  2 

(x)  and  (x)  from  each  other.  That  is. 


1 

12  2 

(x) 

s 

E  (x) 

♦ 

(78) 

*> 

21  1 

(X) 

s 

K  (x) 

(79) 

I  2 

Furthermore,  by  assuming  that  (b)  and  (b)  are  force-like  quantities  and  that  the  energy-like  quantities  expressed  over 

112  2 

the  two  companion  meshes,  i.c„  (b)  (x)  and  (b)  (x)  ,  arc  equivalent,  we  can  write 


2 

21  1 

(b) 

s  F  (b) 

* 

(80) 

1 

12  2 

(b) 

a  F  (b) 

♦ 

(81) 

.'•P 


21  12  1 
V  =  (E  ) 


12  21 T 

F  =  (E  ) 


From  (62),  (78)  ami  (80)  wc.  can  write  an  approximate  expression  for  ((A)  ] 


1-1  12  2  -1  21 

((A)  ]  s  E  [(A)  1  V  (84) 

This  expression  is  the  starting  (joint  for  us  to  construct  a  companion  prccontlilioner  based  on  the  companion  mesh  2. 
2 

The  matrix  (A)  can  be  computed  either  by  using  the  definition  of  A  over  the  companion  mesh  2,  or  by  using  the 
following  expression: 


2  21  l  12 

(A)  =  F  (A)  B 

Wc  note,  for  implcmcntationa!  purposes,  that  (85)  is  equivalent  to 


2  (llc,)l  21  1  12 
(A)  s  I  F  (Ac)  E 


l  2 

Wc  also  note  that,  if  (A)  is  symmetric  and  positive-definite,  so  is  (A)  given  by  the  expression  (85).  However,  wc 

1  -1 

cannot  say  the  same  thing  for  1(A)  |  given  by  the  expression  (84).  Therefore,  to  define  our  companion 
precoiiuiuOiicr,  wc  use  a  regularization  similar  to  the  one  used  in  (52).  The  cluster  companion  preconditioner  based  on 
companion  mesh  2  is  then  defined  as 
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In  sealed  fonn.  (87)  can  be'  rewritten  as  follows: 
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We  also  experimented  with  the  following  modified  version  of  (87): 
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which  can  be  written  in  seated  fonn  as 
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Wo.  can  rcjx'at  the  expressions  given  by  (88)  and  (90)  for  the  cluster  companion  preconditioner  based  on  lire  companion 
mesh  3: 
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Mere  K  anti  B  can  be  computed  either  by  using  definitions  similar  to  those  given  by  equations  (7  !)•  (72)  and  (74)- 
(75),  or  by  using  the  following  reunions: 
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In  any  ease,  the  matrices  F  and  F  are  defined  as 
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Remark 

15,  It  is  quite  clear  that  the  philosophy  behind  this  type  of  preconditioning  is  similar  to  the  philosophy  behind 
nudtigrid  iteration  methods. 

16.  One  could  also  incorporate  the  idea  of  "companion"  meshes  in  conjunction  with  formulations  employing  higher- 
order  elements.  For  example,  for  a  mesh  using  bi-quadratic  elements  only  the  nodes  at  tlie  corners  of  the  higher- 
order  elements  will  form  the  companion  mesh  at  level  2.  To  go  to  level  3  one  could  cluster  elements  in  the 
mesh  at  level  2  and  so  on. 


VII.  Mixed  CEBE/CC  Preconditioning 

It  is  reasonable  to  expect  that  the  CEBE  preconditioner  has  more  intra-cluster  coupling  information  than  the  C'C 
preconditioner  has.  It  is  also  reasonable  to  expect  that  the  CC  preconditioner  has  more  intcr-cluslcr  coupling 
information  than  die  CEDE  preconditioner  has.  Therefore,  because  these  two  preconditioners  in  a  sense  complement 
each  other,  it  is  reasonable  to  hope  dial  when  dicy  are  mixed  together  they  lead  to  better  convergence  rates. 

Initially  our  plan  was  to  use  these  two  prcconditioncrs  alternately  at  each  iteration  of  die  conjugate  gradient 
method  or  at  each  outer  iteration  of  the  GMRES  method.  However,  it  was  recently  brought  to  our  attention  that  $aad~  ’ 
has  formulated  a  new  version  of  the  GMRES  algorithm  which  allows  one  to  change  the  prcoonditioner  C  every  inner 
•'  iteration.  A  GMRES  subroutine  based  on  this  new  formulation  was  made  available  to  us  by  Saad,  and  we  simply  use 
this  subroutine  to  implement  our  mixed  preconditioning. 

In  our  notation,  CERE-1  will  represent  the  CEDE  preconditioning  based  on  clusvcring  levei  I,  CC-I  will 
represent  the  CC  preconditioning  based  on  companion  mesh  I  associated  with  clustering  level !.  ai.d  CERE/CC-I  will 
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represent  the  mixing  of  the  two.  For  example,  CC-1  would  lead  to  a  direct  solution  method,  and  therefore  we  will 
normally  start  our  test  computations  with  I  =  2  or  higher. 

Remark 

17.  We  note  that  as  I  increases,  the  cost  associated  with  CEBE-I  increases  and  the  cost  associated  with  CC-I 
ttof^ases. 


VIII.  Numerical  Examples  for  Unsteady  Incompressible  Flows 

All  solutions  using  the  space-time  formulation,  presented  here,  were  obtained  with  lincar-;n-timc  interpolation 
functions.  For  the  details  of  the  computations  see  Tczduyar  ct  al.1*^,  Mittal  and  Tezduyar^  and  Behr  et  al7. 

A.  Unsteady  flow  past  a  cylinder  at  Reynolds  number  100 

In  this  test  problem  the  dimensions  of  the  computational  domain,  normalized  by  the  cylinder  diameter,  are  30.5  and 
16.0  in  the  flow  and  cross-flow  directions,  respectively.  The  free-stream  velocity  is  0.125.  Reynolds  number  is  based  on 
die  free-stream  velocity  and  the  diameter  of  the  cylinder.  Symmetry  conditions  are  imposed  at  the  upper  and  lower 
computational  boundaries,  and  the  traction-fi  x,  condition  is  imposed  at  the  outflow  boundary. 

To  have  a  better  basis  of  comparison  between  the  solutions  obtained  by  using  Q1Q1  and  P1P1  elements,  meshes 
generated  with  both  the  elements  arc  required  to  have  die  same  distribution  of  the  velocity  and  pressure  nodes.  The  nodal 
values  of  the  stream  function  and  vorticity  arc  obtained  by  the  least-squares  interpolation.  For  the  meshes  generated 
with  the  P1P1  element,  these  quantities  arc  computed  from  the  velocity  field  by  using  the  meshes  generated  with  the 
Q1Q1  element.  For  details  of  the  computations  and  the  performance  characteristics  sec  Tczduyar  cl  al.V 

The  mesh  used  for  Q1Q1  consists  of  5240  elements,  while  the  number  of  elements  for  P1P1  is  10,480.  Both 
meshes  contain  S350  velocity  nodes.  A  time  step  of  0.125  was  chosen  for  the  computations.  The  periodic  solution  is 
computed  by  introducing  a  short  term  perturbation  to  the  symmetric  solution.  We  have  observed,  at  least  for  sm?.il 
perturbations,  that  the  periodic  solution  is  independent  of  the  mode  of  perturbation. 

For  the  OlQl/ST  computations  wc  use  the  clustered  cicmcnt-by-elemcnt  iteration  method  to  solve  *lic  resulting 
equation  system.  At  each  time  step  about  31,500  equations  are  solved  simultaneously.  We  chose  a  Krylov  vector  space 
of  dimension  25  and  a  cluster  size  of  approximately  25  elements.  For  this  problem,  the  CEBE  technique  takes  less  then 
one-seventh  the  CPU  time  and  less  then  one-third  the  storage  needed  by  the  direct  method.  It  should  be  mentioned  at 
this  point  that  wc  have  had  the  successful  experience  of  solving  this  problem,  using  die  space-time  formulation,  with  a 
much  larger  time  step  (1.0).  Here  we  use  a  smaller  time  step  to  compare  the  solution  with  the  ones  obtained  with  other 
formulations. 

Strouhal  number  and  the  time  history  of  the  lift  and  drag  coefficients  for  the  various  formulations  ?xc  shown  in 
Figure  3.  The  Q1Q1  clement  gives  a  Strouhal  number  about  2%  higher  than  what  the  P1P1  element  gives.  Although 
the  lift  and  drag  coefficients  show  no  significant  difference  among  different  formulations,  the  Q1Q1  element  gives  a 
slightly  higher  drag  coefficient  than  the  PI  PI  element,  and  the  Q1Q1/T1  formulation  gives  a  slightly  higher  drag 
coefficient  than  the  Q1Q1/ST  formulation, 

The  periodic  solution  flow  patterns  corresponding  to  die  crest  value  of  die  lift  coefficient  arc  shown  in  Figures 
4-6.  The  patterns  corresponding  to  the  trough  value  of  die  lift  coefficient  are  simply  die  mirror  images,  with  respect  to 
the  horizontal  centerline,  of  the  patterns  shown  in  Figures  4-6.  The  solutions  obtained  with  different  formulations  are 
very  similar.  However  it  can  be  seen,  upon  close  comparison,  that  die  Q1Q1  element  is  less  dissipative  than  the  P1P1 
element  and  that  the  Q1Q1/ST  formulation  shows  less  dissipation  than  the  Q1Q1/T1  formulation.  On  comparing  these 
solutions  with  the  ones  reported  in  Tezduyar  ct  al.-^,  it  can  be  observed  that  (he  solutions  obtained  with  the  Q1Q1  and 
P1P1  elements  are  very  Jose  to  the  ones  obtained  with  the  pQ2Pl  and  Q1P0/T6.  If  wc  compare  these  solutions  with 
the  ones  reported  for  the  T6  formulation  in  Tczduyar  et  al.V  wc  observe  that  T6  formulation  is  less  dissipative  than  the 
T1  formulation  and  the  Q1Q1/S T  formulation  gives  solutions  very  similar  to  Q1Q1/T6. 

B.  Pulsating  dtop 

In  this  problem  the  drop  is  initially  of  elliptical  shape  with  axial  dimensions  1 .25  (horizontal)  and  0.80  (vertical). 
The  density,  viscosity  and  the  surface  tension  coefficient  arc  1.0, 0.001  and  0.001,  respectively.  The  effect  of  gravity  is 
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dimensions  of  the  drop.  Figures  8a,  and  8b  show  ihc  flow  field  and  finite  element  mesh  corresponding,  approximately, 
to  points  "a",  and  '  b"  in  Figure  7. 

C.  Large-amplitude  sloshing 

This  problem  is  similar  to  the  one  that  was  considered  in  Huerta  and  Liu-^.  Initially  the  fluid  is  stationary  and 
occupies  a  2.667  x  1.0  rectangular  region.  The  density  and  viscosity  are  1.0  and  0.002.  The  gravity  is  1.0,  and  the 
surface  tension  is  neglected.  The  wave  is  created  by  applying  a  horizontal  body  force  of  A  sin(cot),  where  A  =  0.01  and 
<0  =  0.978.  The  Reynolds  number  (based  on  the  height  of  the  fluid  and  the  gravity)  is  514.  Inviscid  boundary 
conditions  are  assumed  at  the  walls  of  the  "tank".  Compared  to  the  problem  considered  here,  the  Reynolds  number  used 
in  Huerta  and  Ltu^  is  514,000.  Furthermore,  in  Huerta  and  Liu^  the  horizontal  body  force  is  removed  after  ten  cycles; 
in  this  case,  on  the  other  hand,  this  force  is  maintained  during  the  entire  computation.  The  number  of  elements  is  441, 
and  the  time  step  size  is  0. 107.  With  these  values  of  the  frequency  and  the  time  step  size,  a  single  period  of  the  forcing 
function  takes  60  time  steps.  Figure  9  shows  the  time  history  of  the  vertical  location  (relative  to  the  stationary  level  of 
1.0)  of  the  free-surface  along  the  left-  and  right-  hand-sides  of  the  "tank".  Figures  10a,  and  10b  show  the  flow  field  and 
finite  element  mesh  corresponding,  approximately,  to  points  "a",  and  "b"  in  Figure  9. 

D.  Flow  past  an  oscillating  cylinder 

This  is  a  simple  but  fundamental  fluid-structure  interaction  problem.  The  computation  involves  flow  past  a 
circular  cylinder  that  is  mounted  on  flexible  supports  and  is  free  to  respond  to  the  fluid  forces  in  the  vertical  direction; 
the  Reynolds  number  tor  tliis  simulation  is  324.  The  dimensions  of  the  computational  domain,  normalized  by  the 
cylinder  radius,  are  61.0  and  32.0  in  the  flow  and  cross-flow  directions,  respectively.  The  mesh  employed  consists  of 
4060  elements  and  4209  nodes.  Symmetry  conditions  are  imposed  at  the  upper  and  lower  computational  boundaries, 
and  the  traction-free  condition  is  imposed  at  the  outflow  boundary.  The  periodic  solution  is  obtained  by  introducing  a 
short  term  perturbation  to  the  symmetric  solution.  In  these  compulations,  we  use  the  CEBE  iteration  method  to  solve 
the  resulting  equation  system.  At  each  time  step  about  25,000  equations  are  solved  simultaneously.  We  chose  a 
Krylov  vector  space  of  dimension  25  and  an  average  cluster  size  of  23  elements.  For  this  problem  the  CEBE  technique 
takes  less  then  one-sixth  the  CPU  time  and  less  then  one-third  the  storage  needed  by  the  direct  method. 

At  this  Reynolds  number  the  natural  frequency  of  the  spring  macs  system  and  the  vortex  shedding  frequency  for 
flow  past  a  fixed  cylinder  have  very  close  values.  Consequently,  the  cylinder  undergoes  high  amplitude  oscillations 
(approximately  one  cylinder  radius)  in  the  vertical  direction.  These  oscillations  alter  the  flow  field  significantly.  Figure 
Ua  shows,  for  the  initial  stages  of  the  simulation,  time  history  of  the  lift,  drag  and  torque  coefficients  and  the 
normalized  vertical  displacement  and  velocity  of  the  cylinder.  We  observe  that  the  cylinder  oscillates  with  an  increasing 
amplitude.  The  drag  and  torque  coefficients  for  the  cylinder  also  increase  while  the  lift  coefficient  shows  a  decreasing 
amplitude.  It  is  interesting  to  note  that  both  the  mean  and  peak  values  of  the  drag  coefficient  increase  with  time,  but  the 
trough  value  remains  almost  constant.  The  quantities  displayed  in  Figure  1  la  are  shown  in  Figure  1  lb  for  a  later 
strtch  cf  time  when  the  cylinder  reaches  a  steady-state  oscillation  amplitude  of  about  one  radius.  The  cylinder  oscillates 
wa>.  -*s  natural  frequency,  and  so  docs  the  torque  coefficient;  the  drag  coefficient  oscillates  with  twice  the  natural 
freii.-  s,cy  of  the  cylinder.  The  dominant  frequency  for  the  lift  coefficient  corresponds  to  the  natural  frequency  of  the 
cylindct.  In  addition,  there  is  a  very  small  component  of  the  lift  coefficient  with  thrice  the  frequency  of  the  dominant 
one.  Figure  12  shows  a  sequence  of  frames  for  the  vorticity  during  one  period  of  the  cylinder  motion.  The  first,  third 
and  last  frames  correspond  io  mean  cylinder  location,  while  flic  second  and  fourth  frames  correspond,  respectively,  to  the 
lower  and  upper  extreme  positions  of  the  cylinder.  Fo:  details  of  this  problem  and  the  computations  see  Mittal  and 
TezduyaA 

£.  Flow  past  an  oscillating  airfoil 

This  computation,  penonned  on  the  Connection  Machine  CM-5,  involves  flow  past  a  NACA  0012  airfoil  pitching 
at  Reynolds  number  1000.  The  solution  is  obtained  using  the  space-time  algorithm  with  Galerkin/lcasr -squares 
stabilization.  The  mesh  consists  of  6609  nodes  and  6460  elements.  Linear-in-lime  shape  functions  are  used.  At  each 
time  step,  approximately  39,000  equations  are  solved  simultaneously.  TIic  implicit  equation  system  is  solved  using  the 
GMRES  method  in  conjunction  with  a  diagonal  ptcconditioner.  The  steady-state  solution  for  flow  past  a  stationary 
airfoil  at  an  angle  of  attack  of  10  degrees  and  at  Reynolds  number  1000  is  used  as  initial  condition.  Then  the  airfoil  is 
forced  to  pitch  about  its  half  chord  >vnint  with  a  non-d.metisinnal  frequency  of  1.0  (fcflJ).  Figure  13  the  vorticity  field  at 
various  instants  of  the  pitching  motion.  During  each  period  of  airfoil  oscillation  two  vortices  are  shed,  one  from  the 
leading  edge  and  the  other  from  the  trailing  edge.  For  details  of  this  problem  and  the  implementation  on  the  Connection 
Machine  see  Miual  and  Tezduyar^  and  Sehr  et  al.7. 
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elements  cover  the  entire  domain  and  the  Intersection  between 
elements  occurs  only  on  common  points,  sides  or  triangular  faces  in 
the  three  dimensional  case.  The  final  grid  is  constructed  in  a  bottom- 
up  manner.  The  process  starts  by  discretising  each  boundary  curve. 
Nodes  are  placed  on  the  boundary  curve  components  anJ  then 
contiguous  nodes  are  joined  with  straight  line  segments.  In  later 
steges  of  the  generation  process,  these  seqmonls  will  become  sides 
of  some  triangles.  The  length  of  these  segmenls  must  therefore,  be 
consistent  with  the  desired  local  distribution  of  grid  size.  This 
operation  it  repeated  lor  each  boundary  curve  in  turn. 

The  next  stage  consists  ct  generating  triangular  planar  faces  For 
each  two  dimensional  region  or  surface  to  be  discrotised,  ail  the 
edges  produced  when  discretising  its  boundary  curves  ere  assembled 
into  the  so  called  initial  front.  The  relative  orientation  ol  the  curve 
components  with  respect  to  the  surface  must  be  taken  into  account  in 
order  to  give  tho  correct  orientation  to  the  sides  in  the  initial  front. 
The  front  is  a  dynamic  data  structure  which  changes  continuously 
during  the  generation  process.  At  any  given  time,  the  front  contains 
the  set  ol  all  the  sides  which  are  currently  available  to  form  a 
triangular  lece.  A  side  is  selected  trom  the  front  and  a  triangular 
element  is  generated.  This  may  involve  creating  a  new  node  or 
simply  connecting  to  an  existing  one.  Alter  the  triangle  has  been 
generated,  the  front  is  updated  and  the  generation  proceeds  until  the 
front  is  empty.  Figure  3.1  illustrates  ihe  idea  of  the  advancing  front 
technique  for  a  circular  planar  domain  by  showing  the  initial  front 
and  the  form  of  the  grid  at  various  stages  during  the  generation 
process.  The  size  and  shape  of  the  generated  triangles  must  be 
consistent  with  the  local  desired  size  and  shape  ol  the  final  grid.  In 
the  three  dimensional  case,  these  triangles  will  become  faces  of  Ihe 
tetrahedra  to  be  generated  later. 


Figure  s.f 

The  ehuendng  Irws  technique  thowin*  dkerenl  etegee  Uurtng  the  trlen»vte1ion 
procou. 


For  the  generation  of  teirahedru  the  advancing  front  procedure  is 
taken  one  step  further.  The  front  is  now  mr.de  up  of  the  triangular 
faces  which  are  available  to  form  a  tetrahedron.  The  initial  from  it 
obtained  by  assembling  the  irlangulaliono  of  the  boundary  surfaces. 
Nodes  and  elements  will  be  simultaneously  created.  When  forming  o 
new  tetrahedron,  the  three  nodes  belonging  to  a  triangular  lace  from 
the  front  are  connected  either  to  an  existing  node  or  to  a  new  node. 
Attar  generating  a  tetrahedron,  the  front  is  updated.  The  generation 
procedure  is  completed  when  the  number  ol  triangles  in  the  (ront  is 
zero. 

3.2  Characterisation  of  the  Grid:  G'ld  Parameters 
The  geometrical  characteristics  of  a  general  grid  are  locally  defined 
in  terms  ol  certain  grid  parameters.  It  n  (»2  or  3),  is  the  number  of 
dimensions  thsn,  the  parameters  used  are  a  set  ol  N  mutually 
orthogonal  directions  a,:  i— i. ...  n.  and  n  associated  element  sizes 
jot,  ...  N  (see  figure  3.2).  Thus,  at  a  certain  pnint,  it  all  N  element 
sizes  are  equal,  the  grid  in  the  vicinity  ol  that  point  will  consist  ol 
approximately  equilateral  elements.  To  aid  the  grid  generation 
procedure,  a  transformation  T  which  is  s  function  ol  a  and  «j  is 
delined.  This  transformation  is  represented  by  a  symmetric  n«n 
matrix  and  maps  the  physical  space  onto  a  space  in  which  elements, 
in  the  neighbourhood  ol  the  point  being  considered,  will  be 
approximately  equilateral  with  unit  average  size.  This  new  space 
will  be  relered  to  as  the  normalised  space.  For  a  general  grid  this 
transformation  will  be  a  function  ot  position.  The  transtormetion  T 
is  the  result  ol  superimposing  N  scaling  operations  with  factors  1/6* 
in  each  gj  direction.  Thus 

N 

TtOi.s^X  (3.1) 

itt  6i 

where  ®  denotes  the  tensor  product  of  two  vectors.  The  effect  of 
this  transformation  in  two  dimensions  is  illustrated  in  figure  3.3  for 
the  case  of  constant  grid  parameters  throughout  the  domain. 


Flguro  3,2 

Char  odorts*  lion  o<  iha  grid,  (a)  tho  grid  par«mo*or*  In  two  dtmmsbrw.  (b)  tho  grid 
paramotor*  In  throe  dknonslons. 
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Figure  3  3 

Th*  «Mect  ol  tb®  trrtGtomialiofl  T  Iw  a  conslftnt  dlttifcutton  o4  th®  grtd  paramdert. 


3.3  Crld  Control:  Tha  Background  Qrid 

The  inclusion  of  adequate  grid  control  is  a  key  ingredient  in  ensuring 
the  generation  of  a  grid  ol  the  desired  form.  Control  over  the 
characteristics  is  obtainod  by  the  specification  of  a  spatial 
distribution  of  grid  parameters  by  means  of  a  background  grid.  The 
background  grid  is  used  for  interpolation  purposes  only  and  is  made 
up  of  triangles  in  two  dimonsions  and  tetrahedra  in  three  dimensions. 
Values  ofoi  and  6,  and  hence  T,  are  dalined  at  the  nodes  of  the 
background  grid.  At  any  point  within  an  element  ol  the  background 
grid,  the  transformation  T  is  coinputod  by  linearly  interpolating  its 
components  from  the  element  nodnl  values.  The  background  grid 
employed  must  cover  the  region  to  be  discretised  (see  tigure  3.4),  In 
the  generation  ol  an  initial  grid  lor  the  analysis  of  a  particular 
problem,  the  background  grid  will  usually  consist  ol  a  small  number 
of  elements.  The  generation  ot  the  background  grid  can  in  this  case 
be  accomplished  without  resorting  to  sophisticated  procedures  e  g.  a 
background  grid  consisting  ot  a  single  element  can  be  used  to  impose 
the  requirement  ot  linear  or  constant  spacing  and  stretching  through 
the  computational  domain.  The  generation  process  is  always  carried 
out  in  the  normalised  space.  The  transformation  T  is  repeatedly 
used  to  transform  regions  in  the  physical  space  into  regions  in  the 
normalised  space.  In  this  way  the  process  is  greatly  simplified,  as 
the  desired  size  tor  a  side,  triangle  or  tetrahedra  in  this  space  is 
always  unity.  After  the  element  has  been  generated,  the  coordinates 
of  tho  newly  created  polnl,  if  any,  are  transformed  back  to  the 
physical  space  using  the  inverse  transformation.  The  elfect  of 
prescribing  a  variable  grid  spacing  and  stretching  is  illustrated  in 
figure  3.5  for  a  rectangular  domnin  and  using  a  background  grid 
consisting  ot  two  triangular  elements. 


Figure  3.4 

The  background  grid  ler  the  ■pedlluttlon  ot  e  apetlel  dwribubn  ot  grid  peramewrt 


Figure  3.S 

Grid*  generated  lor  »  redengitler  domain  using  a  background  grid  consisting  ot  two 
elements  to  illustrate  the  eked  ol  variable  grid  spacing  and  stretching 


3.4  Curve  Olecretleation 

The  discretisation  of  the  boundary  curve  components  is  achieved  by 
positioning  nodes  along  the  curve  according  to  a  spacing  dictated  by 
the  local  value  of  the  grid  parameters.  Consecutive  points  are  joined 
by  straight  lines  to  form  sides.  In  order  fo  determine  the  position 
and  number  of  noaes  to  be  created  on  each  curve  component,  the 
following  steps  aro  followed: 

i)  Subdivide  recursively  aacti  cubic  segment  into  smaller  cubic 
segments  until  their  length  is  smaller  than  a  certain  prescribed 
value  A  safe  choice  for  this  value  is  the  minimum  spacing  specified 
in  the  background  grid  but  often,  considerably  larger  values  can  be 
taken.  The  length  of  each  cubic  segment  is  computed  numerically. 
When  subdividing  a  cubic  segment,  the  position  and  tangent  vectors 
corresponding  to  the  new  data  points  can  be  found  directly  from  the 
original  definition  of  the  segment. 

ii)  For  all  the  data  points  ij;  j-1 . n  (i.e.  those  used  to  define  the 

curve  and  those  created  to  satisfy  the  maximum  length  criterion), 
interpolate  from  the  background  grid  the  coefficients  o!  the 
transformation  T|  and  transform  the  position  and  tangent  vectors 

i.e.  ij  >  T|  if  and  £j  *  T^  The  new  position  and  tangent  vectors  4. 

tj;  J«1 . n,  define  a  spline  curve  which  can  be  interpreted  as  the 

image  of  the  original  curve  component  in  the  normalised  space.  It 
must  be  noted  that  because  of  the  approximate  nature  of  this 
procedure,  the  new  curve  will  in  general  have  discontinuities  ot 
curvature  even  though  the  curvature  of  the  original  curve  varies 
continuously. 

Hi)  Compute  (he  length  of  the  curve  in  the  normalised  space  and 
subdivide  it  into  segments  of  approximately  unit  length.  For  each 
nowly  created  point,  calculate  the  cubic  segment  in  which  It  is 
contained  and  Its  parametric  coordinate.  This  information  is  used  to 
determine  the  coordinates  of  the  new  nodes  in  the  physical  space, 
using  the  curve  component  definition. 
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3.S  Triangle  Generation  In  Two  Dimensional  Domain* 

The  triangle  generation  algorithm  utilises  the  concept  of  a  generation 
front  At  the  start  of  the  process  the  front  consists  of  the  sequence 
of  straight  line  peg  merits  which  connect  consecutive  boundary  nodes. 
During  the  generation  process,  any  straight  line  segment  which  is 
available  to  form  an  elament  side  Is  termed  active,  whereas  any 
segment  which  is  no  longer  active  Is  removed  trom  tho  front,  Thus 
while  the  domain  boundary  will  remain  unchanged,  the  generation 
front  changes  continuously  and  needs  to  bo  updated  whenever  a  new 
element  is  formed.  This  updating  process  Is  illustrated  in  figure  3.6. 
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Figure  3.6 

The  front  updating  procedure  In  two  dimension*,  (a)  Th*  tnitiai  Qtnafallon  t/ont.  (b) 
Creation  of  a  new  element  with  (1)  no  new  point  created  (2)  the  new  point  19  is 
created,  (c)  The  updating  of  the  Iron!  for  cams  (b)(2). 


in  the  process  ot  generating  a  new  triangie  the  following  steps  are 
involved  (figure  3.7): 


i)  Select  a  side  AB  of  the  front  to  be  used  as  l  base  tor  the  triangle 
to  be  geneiatea.  il»io,  the  criterion  is  to  choose  the  shortest  side. 
This  is  especially  advantageous  when  generating  irregular  grids. 

ii)  Interpolate  from  tha  background  grid  the  transformation  T  at  the 
centre  of  the  side  M.  and  apply  it  to  the  nodes  in  tho  front  which  are 
relevant  to  the  triangulation.  In  our  implementation  we  deline  the 
relevant  points  to  bo  all  those  which  lie  inside  the  circle  ol  eemre  M 
and  radius  three  times  the  length  of  the  side  being  considered.  Let  A, 
Band M  denote  the  positions  in  the  normalised  space  of  the  points  A. 
B  and  M  respectively. 

iii)  Determine,  in  the  normalised  space,  the  ideal  position  Ei  for  the 
vertex  ol  the  triangular  element.  The  point  £,  is  located  on  the  line 
perpendicular  to  the  side  tha!  passes  through  the  point  fci  and  ai  a 
distance  8,  from  the  points  A  and  fi.  The  direction  in  which  £(  is 
generated  is  determined  by  the  orientation  of  the  side.  Tha  valua  5, 
is  chosen  according  to: 


t  It  0.55-1  <  1  <  2-L 
0.55*1.  if  0.55-L  <  1 
2-L  it  1  >  2-L 


(32) 


where  L  is  the  distance  betwoen  points  A  and  B  Only  in  situations 
where  the  side  AB  happens  to  have  characteristics  very  different 
from  those  specified  by  the  background  grid  will  the  value  of  6\  be 
different  from  unity.  However,  tho  above  inequalities  must  be  taken 
into  account  to  ensure  geometrical  compatibility.  Expression  (3.2)  is 
purely  empirical  and  different  inequalities  could  be  devised  to  serve 
the  same  purpose. 


tv)  Select  other  possible  candidates  fc  •>  vertex  and  order  thorn  In 
a  list.  Two  types  ot  points  ere  considereo  viz.  (a)  all  the  nodes  Q.t, 
&  ...  in  the  current  generation  front  which  are,  in  the  normalised 
space,  interior  to  a  ci':le  with  centre  Et  and  radius  r  •  8,,  and  (b) 

the  set  of  points  Ei . £5  generated  along  the  height  EiM  For  each 

point  Qj,  construct  the  circle  with  centre  fij,  on  the  line  defined  by 
points  Ei  and  U  and  which  passes  through  the  points  &,  A  and  B  The 
position  ot  the  centres  Qi.  of  these  circles  on  the  line  EiM  defines  an 
ordering  ot  the  the  points.  A  list  Is  created  which  contains  all  the 
Q  points  with  tha  furthest  point  trom  £1  appearing  at  the  head  ot 
list.  The  points  Ei . E$are  added  al  the  end  ot  this  list. 

v)  Select  the  best  connecting  point.  This  Is  tho  firni  point  In  the 
ordered  list  which  gives  a  consistent  triangle.  Consistency  is 
guaranteed  by  ensuring  that  none  of  the  new’y  created  sides 
intersects  with  any  ot  the  existing  sides  in  lh»  front. 

vi)  Finally,  if  a  new  node  is  created,  its  coordinates  in  the  physical 
space  are  obtained  by  using  the  inverse  transformation  T  V 

vil)  Store  the  new  triangle  and  update  the  tront  by  adding/removing 
the  relevant  sides. 


Figure  3.7 

The  generation  ci  a  new  triangie. 


This  grid  generation  procedure  is  schematically  presented  in  the 
diagram  shown  in  ligure  3.8 

Grid  Quality  Enhancement 

In  order  to  enhance  the  quality  of  (he  generated  grid,  two  post¬ 
processing  procedures  are  applied.  These  procedures,  which  are 
local  In  nature,  do  not  alter  the  tola!  number  ot  points  or  elomonts 
in  the  grid. 

Diagonal  swapping  -  '.his  changes  the  connectivities  among  nodes  in 
the  grid  without  altering  their  position.  This  process  requires  a  loop 
over  all  the  elament  sides  excluding  those  sides  on  the  boundary.  For 
each  aids  AB  (ligure  3.9)  common  to  the  triangles  ABC  and  ADB  one 
considers  the  possibility  ot  swapping  AB  by  CD,  thus  replacing  the 
two  triangles  ABC  and  ADB  by  the  triangles  ADC  and  BCD.  The 
swapping  Is  performed  if  a  prescribed  regularity  criterion  is 
satisfied  better  by  the  new  configuration  than  by  the  existing  one.  In 
our  Implementation,  tha  awapping  operation  is  performed  if  the 
minimum  angle  occurlng  in  the  new  configuration  is  larger  than  in  the 
original  one. 
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3.6  Surface  Discretisation. 

The  method  followed  tor  the  triangulation  ot  the  surface  components 
is  an  extension  of  the  grid  generation  procedure  tor  planar  domains 
described  above.  The  discretisation  ol  each  surface  component  is 
accomplished  by  generating  a  two  dimensional  grid  of  triangles  in  the 
parametric  plane  (uVu2)  and  then  using  Jho  mapping  i(u\u2)  defined 
in  section  2  2  This  mapping  establishes  a  one  to  one  correspondence 
between  the  boundary  surface  component  and  a  region  on  the 
parametric  plane  (u’.u2)  (figure  3.10).  Thus,  a  consistent  triangular 
grid  in  the  parametric  plane  will  be  transformed,  by  the  mapping 
t(u\us),  into  a  valid  triangulation  ot  the  surface  component.  The 
construction  ot  the  triangular  grid  in  the  parameter  plane  (u’.u2) 
using  the  two  dimensional  grid  generator,  requires  the  determination 
ot  an  appropriate  spatial  distribution  ot  the  two  dimensional  grid 
parameters.  These  consist  ot  a  set  of  two  mutually  orthogonal 
directions  <u’;i»1,  2.  and  two  associated  element  sizes  V;l»1,  2. 


Figure  3.10 

The  mappmg  d  a  surface  component  onto  a  Iwo  dimensional  domain. 


Figure  3.i 

Odd  generation  using  the  advancing  Iron!  technique.  Double  lined  boxes  are  only 
required  d  Ihe  sheds  ol  variable  grid  sUe  and  snatching  ere  lo  be  included 


Grid  smoothing  -  this  alters  the  positions  ot  the  interior  nodes 
without  changing  the  topology  of  fhe  grid.  The  element  sides  are 
considered  as  springs  of  stiffness  proportional  lo  Ihe  !englh  ol  Ihe 
side.  The  nodes  are  moved  until  the  spring  system  is  in  equilibrium. 
The  equilibrium  positions  are  tound  by  iteration.  Each  iteration 
amounts  to  performing  a  loop  over  the  interior  points  and  moving 
their  coordinates  to  coincide  with  those  ot  the  centroid  ot  the 
neighbouring  points.  Usually  ihree  to  five  iterations  are  performed, 

The  combined  application  ol  these  Iwo  post  processing  algorithms  is 
found  to  be  very  effective  in  improving  the  smoothness  and 
regularity  ol  the  generated  grids. 


t>) 


(tri 


Figure  3  * 

The  diagonal  ewepplog  pvocedvre.  (a)  non  admi«*le  (b)  admixsiMe. 


The  two  dimensional  grid  parameters  in  the  (u’.u2)  plane  can  be 
evaluated  from  the  spatial  distribution  ol  Ihe  three  dimensional  grid 
parameters  and  the  distortion  and  stretching  introduced  by  the 
mapping,  To  illustrate  this  process,  consider  a  point  £*  in  the 
parametric  piano  ol  coordinates  (u1p,u2P)  where  the  values  ol  the 
grid  parameters  fy.Ri*;  i»1,2  are  to  be  computed.  Its  image  on  the 
surface  will  be  the  point  £  » i(u1i>,u2P).  The  transformation  between 
the  physical  space  and  the  normalise  1  space  at  this  point  TP  can  be 
obtained  by  direct  interpolation  Irom  me  background  grid.  A  new 
mapping,  valid  in  the  neighbourhood  of  point  P.  can  now  be  defined 
between  the  parametric  plane  (u\u4)  and  the  normalised  space  as 

Hiu’.u2)  ..Tpitu’.u2)  (3.3) 

A  curve  in  the  parametric  plane  passing  through  point  £’  and  with 
unit  tangent  vector  fl  *  (p’.p2)  at  Ihis  poinl,  is  transformed  by  the 
above  mapping  into  a  curve  in  the  normalised  space  passing  through 
the  point  TP  E  The  arc  length  parameters  ds  and  df,  along  the 
original  and  transformed  curves  respectively,  are  related  by  Ihe 
expression  [35] 


wo2={  i  aftSpipi}(‘'s)2  <3-4) 


Assuming  that  this  relation  ootween  the  are  length  parameters  also 
holds  tor  Ihe  spacings,  we  can  compute  the  spacing  Sj  along  tho 
direction  fl  in  the  parameter  plane  as 


/  v  aa.  aa  V'2 

H  1  ) 

l;j«1  > 


(35) 


The  two  dimensional  grid  parameters  af.SfiUl,  2  are  determined 
trom  the  directions  In  which  attains  an  extremum.  This  reduces  to 
lindlng  the  eigenvalues  and  eigenvectors  of  a  symmetric  2x2 

matrix. 
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To  form  the  initial  Iront,  the  (u',u2)  coordinates  ol  the  nodes 
already  generated  on  the  boundary  curve  components  have  to  be 
computed.  As  the  mapping  t( u 1 ,  u * )  cannot  be  inverted  analytically, 
the  coordinates  (u'.u*)  ol  such  points  are  found  numerically  by  using 
a  direct  iteration  procedure  [31). 

3.7  Generation  of  Tetrahedra 

The  starting  point  (or  the  discretisation  of  the  three  dimensional 
domain  into  tetrahedra  is  the  lormation  ol  an  initial  generation  (rent. 
The  initial  front  is  the  set  of  oriented  triangles  which  constitutes  the 
discretised  boundary  of  the  domain  and  Is  formed  by  assembling  Ihe 
discretised  boundary  surfaca  components.  The  order  in  which  the 
nodes  of  these  triangles  are  given  defines  the  orientation,  which  is 
the  same  as  that  of  the  corresponding  boundary  surface  component. 
The  algorithm  for  generating  tetrahedra  is  analogous  to  that 
described  above  for  the  generation  ol  triangles  (sea  figure  3.8). 
However,  in  the  three  dimensional  case  the  range  of  possible  options 
at  oach  stage  is  much  wider  and  the  number  of  geometrical 
operations  involved  increases  considerably.  Thus,  tha  ability  ol  the 
method  to  produce  a  grid  and  the  efficiency  of  its  implementation 
relies  heavily  upon  the  type  of  strategy  seloctsd.  The  generation  of  a 
generic  tetrahedral  element  involves  the  following  steps  (figure 
3.11). 

i)  Select  a  triangular  lace  ABC  from  the  Iront  to  be  a  base  for  the 
tetrahedron  to  be  generated.  In  principle,  any  lace  could  be  chosen, 
but  we  have  found  it  to  be  advantageous  in  practice  to  consider  the 
smallest  laces  first.  For  this  purpose,  the  size  ot  the  face  is  defined 
in  terms  of  the  size  of  its  shortest  height. 

ii)  Interpolate  from  the  background  grid  the  transformation  T  at  the 
centroid  of  the  face  M.  and  apply  it  to  tha  nodes  in  the  front  which 
are  relevant  to  tho  triangulation.  In  our  implementation,  We  define 
the  relevant  points  to  be  those  which  lie  inside  the  sphere  of  centre 
M.  with  radius  equal  to  three  times  the  value  ol  the  maximum 
dimension  of  tho  face  being  considered.  Let  A.  ft,  £.  and  M  denote  the 
positions  in  the  normalised  space  ol  the  paints  A.  ft.  Q.  and  hi 
respectively. 

iii)  Determine,  in  the  transformed  space,  the  ideal  position  £,  for  the 
vertex  of  the  tetrahedral  element.  The  point  Ei  lies  on  the  line  which 
passes  through  the  point  M  and  which  is  perpendicular  to  the  (ace. 
The  direction  in  which  E,  is  generated  is  determined  by  the 
orientation  of  Ihe  lace.  The  location  of  Ei  is  computed  so  that  the 
average  length  of  the  three  newly  created  sides  which  join  point  P. 
with  points  A,  ft.  and  ft.  is  unity.  For  tacos  whose  size  in  Ihe 
parametric  plane  is  very  different  from  unity,  this  step  may  have  to 
be  modified,  as  in  expression  (3  2),  to  ensure  geometrical 
compatibility.  However,  such  cases  rarely  occur  in  practice.  Let 

be  the  maximum  ol  the  distances  between  point  £i  and  points  A,  EL 
and  ft. 

iv)  Select  other  possible  candidetae  tor  tha  vertex  and  order  them  in 
a  list.  Two  types  ot  points  are  considered  viz.  (a)  all  the  nodes  Qt,, 

...  in  the  current  generation  Iront  which  are,  in  tha  normalised 
space,  interior  to  a  sphere  with  centre  M  and  radius  f ■  6|,  and  (b)  a 

new  set  ot  points  Ei . £s  generated  along  tha  height  EiM-  Consider 

the  set  ol  points  A,  B.  and  £  and  denote  by  ft  the  member  of  this  set 
which  is  furthest  away  Irom  M.  For  each  point  Qj,  construct  tha 
sphere  with  centre  ft.  on  the  line  defined  by  points  Ei  and  (A  and 
which  passes  through  point  Q  and  ft.  The  position  of  tha  centres  fii  of 
these  spheres  on  the  line  EiM  delines  an  ordering  of  the  the  Qi  points 
with  the  turthest  point  from  Ei  appearing  at  the  head  of  list.  The 
points  Ei . Es  are  added  at  the  end  of  this  list. 

v)  Select  the  best  connecting  point.  This  is  the  first  point  in  the 
ordered  list  which  gives  a  consistent  tetrahedron.  Consistency  is 
guaranteed  by  ensuring  that  none  ol  ihe  newly  created  sides 
intersects  with  any  of  tha  existing  faces  in  the  front,  and  that  none 
of  the  existing  sides  in  the  Iront  intersect  with  any  ot  tha  newly 
created  laces. 

vi)  II  a  new  node  is  created,  its  coordinates  in  the  physical  apace 
are  obtained  by  using  the  inverse  transformation  T1. 

vii)  Store  the  new  triangle  and  update  tha  front  by  adding/removing 
the  necessary  sides. 


X  HELP  POINTS 
O  POINTS  IN  THE  FRONT 


njuu  an 

Tha  generation  d  a  tetrahedral  element 


3.8  Grid  Quality  Assessment 

Any  discussion  ot  grid  quality  should  be  intimately  related  to  the 
form  of  the  solution  we  are  trying  to  represent  on  that  grid.  Two 
factors  need  to  be  considered  here: 

i)  Determination  ol  the  characteristics  ot  the  optimal  grid  tor  the 
problem  at  hand.  This  introduces  ths  concept  of  adaptivity  and  this 
aspect  is  considered  in  section  5. 

ii)  Assessment  on  how  woll  the  generated  grid  meets  the 
requirements  specified  by  the  grid  parameters.  This  assessment  can 
be  made  by  examining  the  generated  grid  and  determining  tho 
statistical  distribution  ot  certain  indicators.  For  example  in  figure 
3.12  we  havo  chcson  as  indicators  iiie  number  of  elements  around  a 
side,  the  magnitude  ol  Ihe  element  dihedral  angles  and  the  length  ol 
Ihe  side.  These  indicators  are  compared  with  optimal  values  i.e. 
those  o!  a  regular  tetrahedron  which  has  the  exact  dimensions 
specified  hy  Ihe  grid  parameters. 
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Wgure  3.12 
Grid  Quality  ttarift tics. 
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3.9  ©rid  Generation  lor  •  Generic  Fighter  Configuration 
In  computational  aerodynamics,  a  problem  ol  current  Interest  is  the 
prediction  ol  the  inviscid  llovriield  about  complete  aircraft 
configurations.  The  problem  considered  here  Is  the  simulation  ol  the 
How  past  a  generic  lighter  with  canard,  70  20  cranked  delta  wing, 
vertical  lin  and  engine  Inlet.  This  same  configuration  has  been 
studied  previously  using  an  algebraic  grid  generation  approach  [36|, 
Due  to  the  symmetry  ol  the  problem  only  hall  ol  the  lighter  is 
modelled.  Figure  3.13(a)  shows  the  geometry  definition  ol  the 
computational  domain.  The  background  grid  employed  is  illustrated  In 
(igure  3.13(b).  The  curve  components,  defined  in  terms  ol  cubic 
splines  and  the  discretisation  ol  these  components  is  displayed  in 
figure  3.13(c).  The  Individual  surface  components  are  described  by 
tensor  product  surfaces  and  the  surface  discretisation  is  illustrated 
in  figure  3.13(d).  An  intermediate  stags  during  the  tetrahedra 
generation  process  is  displeyed  in  figure  3.13(e).  The  linal  grid 
:onsi$ted  ol  76,522  tetrahedra  and  included  a  full  simulation  ol  the 


figure  111 

Qrld  generefen  tar  •  oonplew  aircraft  conltguafton  (a)  tha  ccmfwWtorwl  domain 
and  tha  geometry  eettnfton.  (6)  tha  hactigraund  grid  (c)  tha  ragtmnmion  at  lha 
tuna  osmeenantu  and  Ox  gtmaraicd  point*,  (d)  lha  twite*  dho*tl*aUon.  (a)  A 
partt*!  vtaw  ct  lha  tatrahadtal  grid. 


3.10  The  Computation  of  Treneonlc  Flow  Peet  an 
Inetalled  Nacelle 

To  illustrate  tha  numerical  performance  ol  the  unstructured  mesh 
solution  procedure,  we  consider  the  analysis  ol  a  transonic  inviscid 
How  past  a  realistic  wlngfpylonfnecelle  configuration  (69).  The 
configuration  has  been  tested  In  a  wind  tunnel  with  e  turbine  powered 
simulator  to  represent  the  How  into  the  engine  and  the  jot  emerging 
Horn  the  exhaust.  The  assumed  symmetry  ol  the  problem  means  that 
only  one  hail  ol  the  configuration  needs  to  be  considered.  The 
definition  of  the  computational  domain  is  completed  by  the  addition  ol 
the  symmetry  plane  and  planar  far  Held  boundaries,  as  shown  In 
figure  3.14a.  Following  the  triartgulatlon  ol  tha  computational 
boundaries,  the  aircraft  surface  Is  represented  by  An  assembly  of 
36,330  triangles  with  19,167  nodal  points.  Two  different  views  ol 
the  surface  triengulation  are  given  in  figure  3.14b.  Starling  from 
this  collection  of  triangular  faces,  tha  computational  domain  is  filled 
with  tetrahedra  by  the  advancing  front  method,  simultaneously 
creating  interior  nodal  poinls  and  elements.  Tha  3D  grid  which  is 
generated  contains  592,380  tetrahedra  and  112,198  nodes.  For  the 
simulation,  a  tree  stream  Mach  number  ol  0.801  and  an  angle  of 
attack  of  2.738  degrees  are  assumed.  The  How  within  the  engine  is 
simulated  by  computing  a  pressure  ratio,  engine  mass  How  rate  and 
a  jet  total  temperature  by  assuming  that  the  separate  core  and  tan 
etraama  In  the  experiment  mix  together  completely  within  the 
turbine  powered  simulator,  before  exl.eueting  through  the  nozzle. 
Starting  from  free  stream  conditions,  the  solution  Is  advanced  for 
2,500  time-steps,  during  which  time  the  residual  is  reduced  by  tour 
orders  ol  magnitude.  The  computed  distribution  ot  the  pressure 
contours  on  the  surface  of  the  model  is  shown  in  figure  3.14c.  In  the 
experimental  configuration,  pressure  guages  are  distributed  at  fixed 
sections  over  the  surface  of  the  wing  and  also  on  the  surface  ot  the 
nacelle.  A  comparison  of  the  computed  and  the  experimentally 
observed  values  of  the  pressure  coefficient  at  a  section  along  the 
wing  and  on  the  surface  ol  the  nacolle  is  given  in  ligure  3.1 4d.  The 
agreement  between  the  numerically  predicted  values  and  the 
experimentally  determined  pressure  data  is  very  good,  apart  from 
the  over  prediction  ot  pressures  on  the  upper  surface  ol  the  wing. 
The  discrepancies  between  the  experimentally  observed  pressure 
distributions  and  the  predictions  ol  the  Euler  How  solver  on  the  wing 
upper  surface  exist  in  regions  where  signiticant  viscous  eflocls  can 
be  expected  to  occur. 


(a) 


5-:i 


!’ 

f 

* 

> 

\ 


l 

\ 


i 

i 


i 

t 

f 


s 


3.11  Th*  Ue*  of  Multi-Grid  Accoloratlon 
The  sequence  of  unstructured  grids  required  for  a  multi-grid 
Algorithm  of  the  type  described  above  is  readily  produced  by  th*  grid 
generator  by  simply  altering  the  scaling  of  the  user  specified 
distribution  functions. 

Transonic  Flow  In  a  Channel 

The  multi-grid  scheme  Is  applied  first  to  the  solution  of  transonic 
How  In  a  channel.  This  is  a  3D  simulation  of  a  2D  flow  past  a  4% 
thick  circular  bump.  Th*  frea  stream  conditions  correspond  to  a 
Mach  number  ol  0.675.  The  compulation  is  parlormed  by  using  a  fine 
grid  of  28,822  tetrahedra  and  throe  coarser  grids  of  4.137,  556 
and  119  tetrahedra  respectively.  The  corresponding  trlanguiaticnc 
of  the  boundary  ol  the  computational  domain  ere  shown  In  figures 
3.15a  to  3.i5d.  Figure  3.15*  displays  the  pressure  solution  alter 
ISO  multi-grid  V-cycles  (/cycle  at)  with  nl  »1,  n2  •  1  and  r>3  *0. 
The  Increase  in  the  rate  ot  convergence  towards  the  steady  state  by 
the  use  ot  tho  multi-grid  scheme  is  readily  observed  in  figure  ?.15( 
which  compares  the  convergence  history  ol  trie  How  calculation  on 
the  linest  grid  with  tho  one  obtained  using  the  multi-grid  scheme 
with  lour  grids. 
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Trenwnte  How  Ir.  «  chennol  nl  *  tree  itiMm  Much  nwnbor  ot  0  STS  (*)  Flat  jiM  ot  1  tS 
elomenfo,  S3  po*Ho  <b  SoconC  grid  ol  SSS  elomontt,  21«  poW>  (e)  TWrd  gild  ot 
4.137  etomentt.  1.142  poti.u.  (d)  Fount  grid  ol  it.tzt  dtermut  .  S.709  point  (•) 
Computod  pn»uro  contm.n.  (I)  Compwtoon  ol  the  convergeoce  hluonot  produced 
by  uotng  rho  Hot  g.W  atone  end  by  uiirtg  muM-grtd. 


Flow  About  a  Twin  Engined  Alrerati 

As  an  illustration  ol  the  performance  ot  tho  multi-grid  acceleration 
procedure  lor  a  realistic  aeronautical  configuration,  a  transonic  tlow 
about  a  twin  engined  Dassault  Falcon  has  been  computed.  For  the 
simulation,  a  tree  stream  Mach  number  ot  0.65  Is  assumed  and  the 
angle  oi  attack  is  taken  to  be  two  degrees.  Flow  through  conditions 
are  imposed  at  the  engines.  Due  to  the  assumed  symmetry  ot  the 
How,  tho  numerical  compulations  were  performed  using  only  one 
hall  of  Ihe  aircraft  geometry.  The  representation  ol  the  boundary  ot 
the  computational  domain  consists  ot  22  surface  components  and  47 
curve  components.  Tho  surface  components  used  to  represent  the 
aircraft  geometry  are  shown  In  figure  3.16a.  Using  this  geometrical 
definition,  three  grids  ere  gonerated,  consisting  ot  863.967. 
212,433  and  100,215  elements  respectively.  The  surface 
triangitlotions  for  these  grids  are  shewn  in  figures  3.16b  to  3.1 6d  . 
A  three  stage  time-slepping  scheme  and  a  double  W  cycle  {icyclo  =2) 
Is  employed,  with  one  pre-  and  one  post-smoothing  slep  used  on  each 
grid(  nl  at,  «:  si  and  n.t  =1).  A  plot  showing  the  ol  Ihe  value  o*  the 
logarithm  of  the  density  residual  versus  the  numbor  of  fine  grid 
residual  evaluations  lor  both  the  line  grid  only  calculation  anc'.  the 
multi-grid  cycle  is  shown  in  figure  3.161.  The  computed  steady  state 
distribution  ot  pressure  contours  on  the  aircraft  surface  in  shown  In 
figure  3.16*.  This  solution  was  obtained  in  about  190  minutes  ol  epu 
time  on  a  modern  supercomputer  using  a  single  processor.  For  this 
example.  Ihe  multi-Qrid  version  ot  the  solver  requires  89  storage 
locations  per  node.  Thos  compares  very  favourably  with  ihe  63 
locations  per  node  required  by  the  flow  solver  on  a  single  grid. 


Figure  3.11 

Flow  pM!  ft  DmimA  Fnicoo  M  »  lr<*  frir#*ni  M»ch  numb*  of  0  85  And  «J  two 
nngKf  oi  Mlac*  (*)  Th*  g«oni»iry  drtlnRion.  (b)  Th*  1l»t  grid  or  100,21$  •Aments, 
19,046  polnl#  (c)  Th*  second  grid  gt  212.43?  •temenl*  «nd  40,322  point*  (d)  Th* 
third  grid  oi  663,967  utemonts,  161,606  points,  (a)  Comparison  ol  (ho  convergatic* 
histories  produced  by  ustrvg  the  line  grid  Aloo*  an/  by  using  multi-grid. 


4  DATA  STRUCTURES 

(This  section  has  been  written  In  collaboration  with  J.  Donet, 
Institute  for  Numerical  Methods  in  Engineering,  University  College. 
Swansea  SA2  6PP.  UK] 

Prom  Iho  provious  section  it  is  apparent  that  a  successful 
implementation  ol  the  presented  algorithm  will  roquiro  tho  use  of 
data  structures  which  onablo  coriain  sorting  and  soarching 
operations  to  be  performed  efficiently.  For  Instance,  the  generation 
front  will  require  a  data  structure  which  allows  for  tho  elficiont 
insnrlinn/delelion  of  sirtes/lncns  and  which  also  allows  lor  the 
efficient  identification  ol  (lie  sides/fncos  which  intersect  with  a 
proscribed  region  In  space. 

The  problem  of  determining  the  mombers  of  a  set  of  n  points  which 
lie  inside  a  prescribed  subregion  of  an  n  dimensional  spnce  is  known 
as  geometric  searching.  Several  algorithms  have  been  proposed 
[37-40]  which  solve  the  above  or  equivalent  problems  with  a 
computational  expense  proportional  to  log(n),  The  problem 
complexity  increases  considerably  when,  instead  ol  considering 
points,  one  deals  with  finite  size  objects  such  lino  segments, 
triangles  or  tetrahedra.  A  common  problem  encountered  here, 
namely  geometric  intersection,  consists  ol  finding  the  objects  which 
overlap  a  cerlain  subregion  ol  the  space  boing  considered. 
Algorithms  for  solving  this  problem  in  two  dimensions  exist  [41]  and 
have  been  applied  in  determining  the  intersection  between 
geometrical  objects  in  the  plane.  To  our  knowledge,  the  only 
algorithm  capable  of  solving  this  problem  in  three  dimensions  is 
based  on  the  use  of  the  alternate  digital  tree  [42).  The  particular 
application  which  motivated  the  development  of  tills  data  structure 
was  the  implementation  of  the  grid  generation  algorithm  described  in 
the  previous  section. 

In  what  follows  we  shall  describe  an  algorithm  and  associated  data 
structure,  called  the  alternating  digital  tree  (ADT),  which  allows  for 
the  efficient  solution  of  the  geometric  searching  problem.  It 
naturally  offers  the  possibility  of  inserting  and  removing  points  and 
optimally  searcnlng  for  the  points  contained  Inside  a  given  region.  It 
is  applicable  to  any  number  of  dimensions,  and  Is  a  natural  oxtonslon 
ol  the  so  called  digital  tree  search  technique  which  Is  exhaustively  in 
[43]  lor  one  dimensional  problems.  A  procedure  which  allows 
treatment  of  any  geometrical  ooject  in  an  N  dimensional  space  as  a 
point  in  a  2H  dimensional  space  will  be  Introduced;  thereby  allowing 
the  proposed  technique  to  be  employed  for  the  solution  ol  geometric 
intersection  problems. 


4.1  Binary  free  Structure* 

Binary  trass  provide  the  basis  lor  severe!  searching  algorithms, 
including  the  on*  lo  he  presented  here.  It  Is  therefore  necessary  to 
Introduce  some  basic  concepts  and  terminology  related  lo  binary 
tree  structures.  More  detailed  expositions  can  be  lend  m  (41,44). 

Definition  end  Terminology 

Troe  structures  provide  a  systematic  way  ol  storing  a  collection  ot 
data  Items  which  enables  not  only  a  quick  access  lo  the  information 
stored,  but  also  Irequont  insertions  and  deletions  ol  item*.  Thie 
degree  ot  tlexiblllty  requires  the  storage  ot  data  Items  In  non* 
sequential  locations  ot  the  computer  memory.  As  figure  4. in 
illustrates,  to  achieve  this,  each  data  Item  is  extended  by  the 
addition  ol  two  Integer  values,  known  as  the  Ml  and  right  links,  and 
Stored  in  what  is  known  as  a  node  o!  the  tree.  Each  added  link  can 
oither  bs  equal  to  zero  or  equni  to  the  position  in  memory  whoro 
anolher  node  ot  the  tree  can  be  lound.  Hence,  from  one  node  of  the 
treo  it  is  possible  to  reach  at  most  two  other  nodes.  Moreovor,  in 
ordor  to  onsuro  that  evory  node  can  bo  reached,  these  links  must  be 
sucli  ttiat  lor  oach  node  except  one,  known  as  the  root,  there  is  one 
and  only  one  link  pointing  ot  It.  This  definition  establishes  a 
hierarchy  ot  nodes:  the  toot  at  the  lop  level  ol  the  hierarchy  points 
at  o,  1  or  2  nodes  ot  the  next  level;  each  ol  these  in  turn  points  ot 
oilier  o,  1  or  2  nodos  at  tho  next  level  of  Ihe  hierarchy;  and  so 
forth.  This  hierarchical  structure  inspires  the  graphical 
representation  shown  in.  figure  4.1b  lor  a  simple  tree  comprising 
only  eight  nodes  (A,  B,  C,  0.  E,  F,  0.  H). 

Gonoaloglcol  terms  are  normally  used  to  doscribo  tho  lolativo 
position  of  nodes  in  a  troe:  when  a  node  points  at  a  second  nodo,  the 
former  is  called  tho  father  ot  the  latter,  and  this  the  son  ol  the 
former  nodo.  A  node  without  sons,  that  is.  with  both  links  blank,  is 
caltod  a  terminal  node,  and  tho  only  nodo  without  a  (other  Is  the  root 
(node  A  in  hguro  4.1b).  Givon  a  node.  Ihe  set  ol  nodos  formed  by 
itsotl  togothor  with  ail  its  doscondants  constitutes  a  subtree  ot  tho 
main  troo.  For  instance.  In  figure  4.1b  the  trees  |C,  D,  E,  F,  G,  H) 
and  (F„  0,  It)  are  subtroos  ol  the  main  tree  rooted  al  C  and  E 
respectively. 


iOOR(55 


ui 


lb] 


Fit'll*  4.1 

A  4'inp4*  binary  and  H*  iio<ag*  w  compvrt*  ro*in«y 


Tree  Trevereal 

To  retrieve  information  stored  in  a  given  node  requires  knowledge  ot 
its  location  in  memory,  which  is  kept  by  its  father.  Hence,  a  node  In 
the  tree  can  only  be  examined  or  visited  il  at)  Its  ancestors  are 
visited  first.  However,  il  is  possible  to  systematically  examine  each 
node  In  such  a  way  that  every  node  is  visited  exactly  one*.  Such  an 
oneration  is  known  as  traversing  the  tree  and  provides  the  basis  for 
the  searching  methods  discussed  below.  Although  sevoral  algorithms 
can  be  tound  in  the  literature  to  traverse  a  '^nary  tree  [44), 
attention  whl  be  centred  here  on  the  so-called  preorder  traversal 
method.  This  technique  is  embodied  In  the  following  three  eleps; 


t.  Visit  the  root  of  Ihe  current t  subnet 

2.  II  Ihe  tell  link  of  the  root  is  not  zero  then  Inverse  the  left 
subtree. 

3.  II  the  right  link  ol  Ihe  root  is  not  zero  then  Inverse  Ihe  right 
subtree. 

The  procedure  determined  by  these  three  steps  Is  cleatly  recursive, 
that  Is.  steps  2  and  3  Invoke  again  the  algorithm  which  they  deline. 
In  order  to  llluslrat-  this  process,  consider  again  tho  troo  shown  In 
figure  4.1b;  lor  this  tree,  the  repeated  application  ol  the  above 
algorithm  yields  the  following  sequence: 

1.  Traverse  the  tree  (A,  B,  C,  0,  E,  F,  G,  H‘ 

1.1.  Visa  A 


1.2.  Traverse  Ihe  fret*  |0) 

1.2.1.  mil a 

1.2.2.  Skip 
1.2.3  Skip 

1.3.  Traverse  the  tree  (C,  D,  E,  F,  G,  II) 


1.3.1. 

Visit  C 

1.3.2. 

Traverse 

the  tree 

ID.  F) 

1.3.21. 

Visit  D 

t  .3.2.2. 

Traverse 

Ihe  tree 

1.3. 2.2.1 

Visit  F 

1.3.2.2.2 

Skip 

1.3. 2.2.3 

Skip 

1, 3.2.3 

Skip 

1.3.3. 

Travorse 

ihe  tree 

IE.  G,  H) 

1 ,3.3,1. 

MILL 

1. 3.3.2. 

Traverse 

Ihe  tree 

1. 3.3.2. 1 

XhiLG 

1,33.2.2 

Skip 

t. 3.3.23 

Skip 

1 .3.3.3. 

Traverse 

(he  free 

1. 3.3.3. 1 

Visit  H 

1. 3.3.3. 2 

Skip 

13.3.3.3 

Skip 

Thus,  tho  nodes  ot  the  Iroo  in  ligure  4.1b  In  preoidsr  are  A.  3,  C,  0, 
F.  E,  G  and  H. 

We  notice  in  the  above  algorithm  that,  betore  moving  on  to  traverse 
the  left  subtree  -  step  2  In  the  previous  algorithm  -  II  Is  necessary 
to  store  the  value  of  Ihe  right  link,  that  Is,  the  addtoss  ol  Ihe  right 
son,  in  order  to  unable  the  subsequent  travo'sal  ol  the  right  subtree. 
Moreover,  whilst  traversing  the  loft  subtree  it  Is  likely  that 
additional  right  links  will  have  to  be  stored,  in  tact,  a  list  containing 
the  addrosses  ot  all  right  suhtrees  encountered  along  ilia  way  which 
me  yet  to  be  traversed,  must  be  kept  end  has  to  be  continuoualy 
updated  as  follows.  After  visiting  each  node,  the  right  link,  II 
different  from  zero,  Is  added  to  the  hst  end  if  the  left  link  is  not 
zero  the  tell  subtree  is  traversed.  When  a  zero  left  link  is 
encountered,  the  last  right  link  inserted  In  the  list  is  retrieved,  as 
well  as  removed,  from  the  list  and  the  subtree  rooted  at  this 
address  Is  traversed. 
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Figure  4.2 
The  irtWKon  proc*u. 


,W*I 


This  type  ol  list.  Ill  which  Items  ate  inserted  on*  by  one  and 
expected,  also  one  at  a  time,  In  the  reverse  order,  is  Known  ns  a 
stack  [44).  A  stack  consists  ot  a  linear  airay,  or  vector,  together 
with  an  Integer  variable  to  record  the  number  ot  Items  In  the  army. 
This  variable,  being  initially  zero,  Is  increased  by  one  every  time  en 
Item  Is  added  to  the  stack  and  decreased  by  one  when  an  hern  is 
extracted  trom  it. 

With  the  half)  ot  a  stack,  any  recursive  algorithm  can  be  implemented 
without  the  need  to  use  recursive  routines.  For  Instance,  a  non* 
recursive  implementation  ot  the  traversal  algorithm  given  above  can 
be  symbolically  expressed  as: 
o,«  Sol  i  ooi  odui  uon  •  address  ot  tho  root  node 

O.t)  Sot  fit  nck_n  lr.c  ■  0 

1 .  Visit  tho  niKhi  stored  ot  s  oot  avMroof? 

2.  If  *  0  thtfi): 

Sot  rttn“k_  .  *.  t| 

*  riyht_Hnk 

endif 

3a.  JM*ri_ link  *  o  th&n: 

$6t  toot  AiMuti6  w  1  oft _ link 

yo  to  1 

endif 

3b.  If  lod. link  -  0 

If  «»t  ,-wk  _t»  l  so  *  o  thon; 

Sot  voM •  i\t  Ack  tntAek.  si  rol 
Sot  3t«ck..»i;o  r  st>ick.(iiio  -  i 
go  to  t 
on  dll 

end/t 

tin t.ick_r.iro  *  o  -►  terminate  the  process 


Inserting  and  Oeletlng  Nodes 

In  order  to  add  o  now  data  Item  to  a  binary  treo,  a  node  containing 
tho  now  hem  ot  Information  must  bo  created  and  stored  In  a 
convenient  memory  location.  Tho  lett  and  right  links  ot  this  node  are 
set  to  ze.o.  It  the  current  tree  is  empty,  the  new  node  becomes  tho 
root  ol  the  tree,  otherwise  the  node  must  be  Inserted  or  linked  to  tho 
existing  treo.  To  achievo  this,  the  tree  Is  lollowed  downwards, 
starting  trom  the  root  and  lumping  trom  lather  to  son,  until  a  blank 
link  Is  found.  This  link  is  then  set  to  iho  memory  position  ot  the  now 
node.  When  moving  down  tho  tree,  a  criterion  must  bo  provided  at 
each  node  to  chose  between  the  tell  or  right  branches.  Tills  ciiteilon 
determines  tho  tinal  position  in  (he  tree  ol  the  new  node  and. 
consequently,  the  shape  ol  the  tree  itselt. 


Oeletlng  a  node  from  a  binary  tree  is  a  sliaightloiward  operation  il 
the  undesired  nodo  is  a  terminal  node;  changing  'o  zero  tho 
coiresponding  link  ot  its  lather  elleclivaly  'prunes'  the  node  trom 
the  tree  and  renders  the  memory  occupied  by  it  available  tor  lulure 
uses.  In  the  case  ot  an  intermediate  node,  the  process  becomes 
slightly  more  complicated  since  a  gap  can  not  be  left  In  tho  troe.  To 
overcome  this  problem,  the  unwanted  node  Is  replaced  by  a  terminal 
nodo  choson  trom  among  its  descendants.  Tms  operation  can  be 
carriod  out  by  modifying  the  links  to  sun  the  new  structure  ot  the 
tree  and  without  moving  the  nodes  trom  thoir  memory  positions. 
Figuros  4.2a  and  4.2l>  illustrate  the  dototion  ol  nodo  C  troin  tho  troe 
shown  In  liguies  4.1a  and  4.1b  and  its  replacement  by  node  H 

II  tho  application  at  hand  demands  Irequent  dotation  and  insertion,  a 
memory  book-keeping  system  is  necessary  tor  the  etticiont 
implementation  ot  tree  structures.  This  is  required  so  that  now 
nodes  can  bo  placed  in  (he  memory  space  released  by  iho  dotation  ot 
previous  nodes  This  problem  can  be  solved  by  using  a  linked  list 
structure  to  record  nil  the  available  memory  spaces.  A  linked  list  Is 
a  data  structure  mat  differs  from  the  binary  ties  data  structure 
described  above  In  that  every  nodo  has  always  only  one  tmk  pointing 
at  another  node,  and  oveiy  node  has  always  one  link  pointing  at  II. 
There  are  two  exceptions,  which  are  the  head  and  the  end  nodes 
The  head  Is  a  nodo  with  no  link  pointing  at  it  •  tho  address  ot  which  Is 
kept  separately  ■  and  the  end  Is  a  node  with  a  blank  link. 

As  shown  In  tigure  4.3  the  two  dam  structures,  binary  treo  and 
linked  list,  are  updated  simultaneously,  initially,  the  available 
memory  is  partitioned  into  cells  ot  Iho  correct  size  to  store  tree 
nodes  These  cells,  which  contain  no  relevant  information  other  than 
•  single  link,  are  than  joined  together  to  form  a  linked  list.  Every 
time  a  node  needs  to  be  inserted  into  the  tree,  the  memory  space 


required  by  this  now  tree  node  Is  generated  by  removing  a  node  horn 
the  list  (see  tigure  -1.3b).  Similarly,  when  a  node  Is  deleted  horn  the 
tree  it  Is  added  to  the  list  (soo  tigure  4.3c).  Insorting  and  deleting 
nodes  in  the  list  always  takes  place  at  the  hsad.  To  Insert  a  node  into 
the  list,  the  link  ol  the  new  node  Is  set  equal  to  tliu  address  ot  the 
head  and  the  Insetted  node  becomar  the  new  head  ot  the  list.  The 
deletion  ot  the  head  node  can  be  done  by  simply  allowing  lis  link  to  bo 
the  now  head. 


MKWtSS 


fc-'-M&ifciL 


(«> 


A00RCS5 


0 - ©  *©  *© - © 


snonr ss 


N*  • 

N,  - 

UnlS 

Nt - - 

H. 

Dillil 

\ - - 

N„  - 

bssbss 

N% - 

-»«v'  t'-iC’le, 

P1B1W 

*1.  A  |P  ,  4  .X-  >  Mlk 

N, - e- 

Bum 

n4 - 

-iihO  w,  Om. 

moot  i 


IliUUI 


Flpui*  4.3 

6ln*ry  *%\  cool^urmkm*.  (a)  Tto  u<w  ol  (tport  4  t  (b)  An*  tba 

'maitloo  ol  no<##  I  u*-‘ng  tHa  tioiag#  fyy  nod*  a  (*)  Att*r  noJ« 


ck,'  x  ck',  dj,,1  a  dk'  for  i  *  j  and 


4.2  Tha  Alternating  Digital  Traa 

Consider  a  set  of  n  points  in  a  N  dimensional  space  (RN  )  and  a3sume 
(or  simplicity  that  the  coordinate  values  of  their  position  vectors 
(&i.  ...  £n).  alter  adequate  scaling,  vary  within  the  interval 

[0,1).  The  aim  ol  geometric  searching  algorithms  is  to  select  Irom 
this  set  those  points  that  lie  inside  a  given  subregion  of  the  spaco.  To 
facilitate  their  representation,  only  rectangular  -  or  'hyper- 
rectangular'  -  regions  will  be  considered,  thereby  allowing  their 
definition  in  terms  ol  the  scaled  coordinates  of  the  lower  arid  upper 
vertices  as  (A.  ft). 

Comparing  the  coordinates  of  each  point  k  with  the  vertex 
coordinates  ol  a  given  subregion  to  check  whether  the  condition  aj  £ 
xhj  sb,  is  satisfied  lor  i  =  t.  2  ...  n,  would  render  the  cost  of  the 
searching  operation  proportional  to  the  number  of  points  n.  This 
computational  expense,  however,  can  be  substantially  reduced  by 
storing  the  points  in  a  binary  tree,  in  such  a  way  that  the  structure 
ot  the  tree  reflects  the  positions  ot  the  points  in  space.  There  exist 
several  well  known  algorithms  that  will  accomplish  this  effect  for 
one  dimensional  problems;  the  most  popular  are  the  binary  search 
tree  and  digital  tree  methods  [41,43].  Binary  search  trees  have 
been  extended  to  N  dimensional  problems  In  [43],  but  the  resulting 
tree  structure,  known  au  N-d  trees,  do  not  allow  the  efficient 
deletion  ot  nodes.  The  algorithm  presented  here  is  a  natural 
extension  ol  the  one  dimensional  digital  tree  algorithm  and 
overcomes  the  difficulties  encountered  in  N-d  troes. 

Definition  and  Node  Insertion 

Broadly  sp  jakmg,  an  alternating  digital  troo  can  be  defined  as  a 
binary  tree  in  which  a  set  of  n  points  are  stored  following  certain 
geometrical  criteria.  These  criteria  ore  based  on  the  similarities 
arising  between  the  hierarchical  and  parental  structure  ol  a  binary 
tree  and  a  recursive  bisection  process:  each  node  in  the  tree  has  two 
eons,  likewise  a  bisection  process  divides  a  given  region  into  two 
smailor  subregions.  Consequently,  it  is  possible  to  establish  an 
associalion  botwoen  tree  nodes  and  subregions  ot  the  unit  hypercube 
as  follows;  the  root  represents  the  unit  hypercube  itself;  this  region 
is  now  bisected  across  the  x1  axis  and  the  region  for  which  0  £  x1  < 
0.5  is  assigned  to  the  left  sen  and  the  region  lor  which  0.5  <  x1  <  1 
is  assigned  to  the  right  son,  ai  each  of  these  nodes  ihe  process  is 
repeated  across  the  x2  direction  as  shown  in  tigure  4.4.  In  a  two 
dimensional  spoe-.i  procoss  can  bo  repeated  indefinitely  by 
cnosing  x’  and  x*:  1  actions  in  alternating  order:  similarly,  in  a 
genor.il  N  dimensional  space,  the  process  can  be  continued  by 
choosing  directions  x1,  x2.  ...  xN  in  cyclic  order. 
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Figure  4.4 

The  ret *i ton  between  e  biniiy  tree  anS  e  bisection  process 


Generally,  it  a  node  k  at  the  hierarchy  level  m  -  the  root  being  level 
0  represents  a  region  (Ck.  tlx),  the  r  jregions  associated  to  its  left 
and  right  sons.  (gw.  du)  did  (&*,.  &>,,)  result  from  the  bisection  ol 
(fit,,  da)  by  a  plane  normal  to  the  j-th  coordinate  axis,  where  j  is 
-<  >  cyclically  from  the  N  space  directions  as: 

j  a  1  r  mod(m.N)  (4.1 ) 

and  tnod(m,N)  denotes  the  remainder  ot  the  quotient  ot  m  over  N. 
Mence  (fiw,  ftw)  and  (Car.  Slw)  are  obtained  as; 

cw  ca'.  <V  -  <V  *«r  i  *  j  and  ck,i  =  okl,  dk|i  *  \  (c*!  ♦  d*l) 

<4.2a> 


cyi  =  7  (cki  +  dki),  dk|i  =  dki 
(4.2b) 


This  correlation  between  nodes  and  subdivisions  ol  the  unit 
hypercube  allows  an  ADT  to  be  further  delined  by  imposing  that  each 
point  in  the  tree  should  lie  inside  tho  region  corresponding  to  the 
node  where  It  is  stored.  Consequently.  II  node  k  ol  an  ADT  structure 
contains  a  point  with  coordinates  a*,  the  following  condition  must  he 
satisfied: 


ck*  £  xk*  <  tfh*  for  i  =  1,2  n  (4.3) 

Due  to  this  additional  requirement  there  exists  only  one  possible  way 
in  which  a  new  point  can  be  inserted  in  the  tree.  As  discussed  in  the 
previous  section  the  tree  is  followed  downwards  until  an  unfilled 
position  where  the  node  can  be  placed  is  found.  During  this  process, 
however,  left  or  right  branches  are  now  chosen  according  to 
whether  the  new  point  lies  inside  the  region  related  to  the  loll  or 
right  sons,  thereby  ensuring  that  condition  (4.3)  is  satisfied. 

Given  a  predetermined  set  ot  n  points,  an  ADT  structure  can  be  built 
by  placing  anyone  point  at  the  root  end  then  inserting  the  remaining 
points  in  consecutive  order  according  to  the  algorithm  described 
above.  This  is  illustrated  in  tigure  4.5  for  a  set  ot  5  points 
(A,8,C,D,E}.  The  shape  ol  the  troo  obtained  in  this  way  depends 
mainly  on  the  spatial  distribution  of  the  points  and  somewhat  on  the 
order  in  which  the  points  wore  inserted.  The  cost  ol  operations  like 
node  insertion/deletion  and  geometric  searching  depends  strongly  on 
the  shape  ot  the  tree;  generally  poor  performances  are  to  be 
expected  Irom  highly  degenerated  trees  (see  figure  4.6).  whereas 
well  balanced  trees  (see  figure  4.7),  as  those  obtained  tor  tairly 
uniform  distributions  ot  points,  will  result  in  substantial  reductions 
of  the  searching  cost.  In  these  cases  the  average  number  of  levels  In 
the  tree,  and  theretoro  the  average  cost  ot  inserting  a  new  point, 
becomes  proportional  to  log(n):  clearly  a  considerable  cost  it 
compared  with  the  cost  ot  storing  the  poir  a  sequential  list,  but 
fully  justifiable  in  view  ot  the  reduction  in  ching  costs  that  ADT 
structures  will  provide. 


Figure  4.9 

Buikfing  an  ADT  by  suceesstv*  Irrsartton 


Geometric  Searching 

Consider  now  a  set  ol  points  stored  in  an  ADT  structure.  The  tact 
that  condition  (4.3)  is  satisfied  by  every  point  provides  tho  key  to 
the  efficient  solution  ol  a  geometric  searching  problem.  To  illustrate 
this,  note  first  that  the  recursive  structure  of  the  bisection  process 
described  above  implies  that  the  region  related  to  a  given  node  k 
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consequently,  all  points  stored  ir  these  nodes  must  also  lie  inside  the 
region  represented  by  nods  k.  For  Instance,  all  points  in  the  ADT 
structure  are  stored  in  nodes  descended  fiom  th>-  i  >ot  and,  clearly, 
all  of  them  lie  inside  the  unit  hypercubo  -  tho  region  associated  with 
the  loot.  Analogously,  the  complete  sot  of  points  stored  in  any 
subtree  is  inside  the  region  represented  by  the  root  of  the  subtree 

This  feature  can  be  effectively  used  to  reduce  the  cost  of  a 
geometric  searching  process  by  checking,  at  any  node  k,  the 
intersection  between  the  searching  range  (a.  b)  and  the  region 
represented  by  node  k,  namely  (&,  six).  It  these  two  regions  tail  to 
overlap,  then  the  complete  set  ot  points  stored  in  tlie  subtree  rooted 
at  k  can  be  disregarded  from  the  search,  thus  avoiding  the  need  to 
examine  the  coordinates  ot  every  sing!>  point. 


Figur*  4.6 


Flflur#  4.7 
A  WiFi  balanced  Ire* 


Consequently,  a  systematic  procedure  to  select  the  points  that  lie 
inside  c  given  searching  range  (a,  b!  can  be  derived  tram  the 
traverse!  algorithm  previously  presented.  Now  the  generic  operation 
'visit  ihe  root'  can  be  re-interpreted  as  checking  whether  the  point 
stored  in  the  root  tails  inside  the  searching  range.  Additionally,  the 
left  and  right  subtrees  need  to  be  traversed  only  it  the  regions 
associated  with  thoir  respective  root  nodes  intersect  with  the  range. 
Accordingly,  a  goomolric  searching  algorithm  emerges  in  a 
recursive  form  as: 

1 .  Chock  whother  tho  coordinates  ot  the  node  stored  in  the  root,  say 
X*.  are  inside  (a.  bj  i.e.  check  whether  a1  <  xk*  <  b'  (or  i  =  u  ...  n. 

2.  It  the  Intt  link  ol  the  root  is  not  zero  end  the  region  (g^,  b*|) 

overlaps  with  (a,  b)  '  «■  'I  d»/  £  a'  and  ck|'  s  b1  tor  i  »  t.2  ...  n, 

search  the  left  subtree. 

3.  It  the  right  link  ol  the  root  is  not  zero  and  the  region  (£kr,  bj,r! 

overlaps  with  (a,  bj  i.e.  it  d,,'  i  a1  and  ckr*  s  b1  tor  i  *  t.2  ...  n, 

search  the  right  subtree. 

In  order  to  illustrate  this  process,  consider  the  set  of  pumts  and  the 
searching  range  shown  in  figure  e.Sa  and  (he  corresponding 
alternating  digital  tree  depicted  In  llgure  4.8b.  For  this  simple 
example,  Ihe  algorithm  given  above  results  in  the  following  sequence 
ot  steps: 


Search  the  tree  (A,B,C,D,E,F,G,H|: 

t .  Check  it  a1  S  x*1  s  b1  for  i  *  1 ,2 

2.  Since  dg1  a  a1  and  ca1  5  b1  search  the  tree 
(B.C.D.E): 

2. 1  Check  it  a1  s  x8' s  b1 

2.2.  Since  dc1  a  a1  and  cc'i  b1  search  tho  tree 
(C,E;: 

2.2.1.  Check  if  a'sxc'sb' 

2.2.2.  Skip  (loft  link  is  zero) 

2.2.3.  Skip  (eg1  >  b1) 

2.3.  Skip  (CD2  >  b2) 

3.  Skip  (cF'  >  b1) 

Again  a  'non-reeursive’  implementation  ol  this  algorithm  can  be 
achicvad  using  a  slack  In  a  very'  similar  way  to  that  previously 
described  for  the  traversal  algorithm. 

Note  that,  with  this  technique,  only  the  coordinates  ot  points  A.B  and 
C  are  actually  examined,  the  rest  being  immediately  disregarded  in 
view  of  their  position  in  the  tree.  In  general,  only  those  points 
stored  in  nodes  with  associated  regions  overlapping  (a.b)  will  be 
checked  during  the  searching  process. 
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Flgur*  4.1 

Th#  searching  problem  iit  a2 


4.3  Geometric  Intereection 

Geometrical  intersection  problems  can  be  toumf  in  many  applies  .  ns; 
for  instance,  a  common  problem  that  may  emerge  in  contact 
algorithms  [46],  hidden  line  removal  applications  or  in  the  advancing 
front  grid  generation  algorithm  presented  in  section  3,  is  to 
determine  from  a  set  of  three  noded  triangular  elements  those  which 
intersect  with  a  givon  ,-ne  segmont.  Similar  problems,  involving 
ether  geometrical  objects,  are  encountered  in  a  wide  range  ol 
geometrical  applications.  In  general,  a  geometric  intersection 
problem  consists  ot  finding  from  a  set  of  geometrical  objects  those 
which  intersect  with  a  given  object.  If  every  one-to-one  intersection 
is  investigated,  Ihe  solution  ol  these  problems  can  become  very 
expensive,  especially  when  complex  objects  such  as  curves  or 
surfaces  are  Involved.  Fortunately,  many  cf  these  one-to-one 
intersections  can  be  quickly  discarded  by  means  ol  a  simple 
comparison  between  the  coordinate  limits  of  every  given  pair  of 
objects  For  instance,  a  triai  .gle  with  x-coordinate  varying  from  0.5 
and  0.7  cannot  intersect  with  a  segment  with  x-coo'dinate  ranging 
from  0.  i  to  0.3.  Generally,  the  intersection  between  two  objects  in 
the  N  dimensional  Euclidean  space,  requires  each  of  the  N  pairs  of 
coordinate  ranges  to  overlap.  Consider  for  instance  the  intersection 
problem  between  triangular  facets  and  a  target  straight  line  segment 
in  H5;  then,  it  Sx,ma«)  are  the  coordinate  limits  ot  elemen*  k 

and  (Xo,min,  Xo,m*»)  are  the  lower  and  upper  limits  ol  the  target 
segment  (see  lipure  4.3),  an  important  step  towards  the  solution  of 
a  geometric  intersection  problem  is  to  select  those  which  satisfy  the 
inequality: 


Xk.mln1  S  Xo.m**1 
Kk.ma**  2  Xo^in* 


for  i  =  M  ...  N  (4.4) 


The  cost  ol  checking  condition  (4.4)  lor  every  element  grows 
proportionally  to  n,  and  for  very  numerous  self  may  become 
prohibitive.  This  cost,  however,  ccn  be  substantially  reduced  by 
using  a  simple  device  whereby  Ihe  process  of  selecting  those 


r-2 1 


elements  which  satisfy  condition  (4.4)  can  bs  Interpreted  as  a 
geometric  searching  problem.  Additionally,  since  the  number  ot 
elements  that  satisfy  condition  (4.4)  will  normally  be  much  smaller 
than  n,  the  cost  of  determining  which  of  these  inteisects  with  the 
target  segment  becomes  affordable. 


Using  this  representation  of  a  given  ooject  k,  condition  (4.S) 
becomes  simply: 


a'SXK'sb1  fori  =i,2...  jn  (4.7) 


Flgur«  4.9 

Thu  definition  ol  coordinate  limits  for  triangular  elements  and  straight  line 
segments. 


where  a  and  fe  can  be  interpreted  as  the  lower  and  upper  vertices  ot 
a  hyper-rectangular'  region  in  R2N  and,  recalling  (4.5),  their 
components  can  be  obtained  in  terms  of  the  coordinate  limits  ot  the 
target  object  (see  figure  4.11)  as: 

a  a  [  0,  .  .  .0,  Xo.rn.-ix1 »  •  ■  ■  xo,s,V  (A. 8a) 

kh  =  [  ko.mln1-  ■  •  •  x0.mtn^»  1,  ...  1)  (4.8b) 


In  order  to  interpret  condition  (4  4)  as  a  geometric  searching 
problem,  it  is  first  convenient  to  assume  that  all  the  elements  to  be 
considered  lie  inside  a  unit  nyporcjbo  -  a  requirement  that  can  be 
easily  satisfied  through  adequate  scaling  ot  the  coordinate  values. 
Consequently,  condition  (4,4)  can  be  re-written  as: 


0  5  Xx.mln1  -  Xo.max1 

C  S  Xx.mlnN  Ko.mnx1'1 
x0.min1  -  xX.inax1  ^  ^ 


(4.5) 


Figure  4.11 

Th#  int*fS4ction  picbtum  In  d1  as  a  Marching  problem  In 


Consequently,  the  problem  ot  finding  which  objects  in  RN  satisly 
condition  (4.4)  becomes  equivalent  to  a  geometric  searching  problem 
in  R2N  i.e.  obtaining  the  points  s*  which  lie  inside  the  region  limited 
by  a  and  St-  Once  this  subgroup  ot  elements  has  been  selected,  the 
intersection  of  each  one  ol  them  with  the  target  object  must  be 
checked  to  complete  the  solution  of  the  geometric  intersection 
problem. 


x0.mln^  -  kX.nnx1'1  -  ^ 

Consider  now  a  given  object  k  in  RN  with  coordinate  limits  Jiu.mm. 
and  ju.max:  combining  this  two  sets  ol  coordinate  valuos,  it  is 
possible  to  view  an  object  k  in  R*1  as  a  point  in  R2N  with  coordinates 
Xx:lor  i  =  i.s  2N  defined  as  (see  figure  4. to): 

Ak-  [  kH.reln1,  -  ■  •  kt-.nrin^i  Xx.max1-  •  ■  *  Hk.max^l  (A. 6) 


Figure  4.10 

The  reeretectition  ot  e  rtgkxi  in  A1  44  4  point  In  R2 


4.4.  The  Uae  ol  the  AOT  tor  Grid  Generation 

It  is  obvious  Irom  the  advancing  front  algorithm  described  in  section 
3  that  operations  such  as  searching  tor  the  points  inside  a  certain 
region  ot  the  space  and  determining  intersections  between 
geometrical  objects  •  in  this  case  sides  and  tacoc  -  will  be 
performed  very  frequently.  The  complexity  ot  tho  problem  is 
increased  by  the  fact  that  the  sei  ol  laces  forming  the  generation 
front  changes  continuously  as  new  laces  need  to  be  inserted  and 
deleted  during  the  process.  Clearly,  lor  grids  consisting  ol  a  large 
number  ot  elements  the  cost  ol  performing  this  operations  can  be 
very  Important. 

A  successful  implementation  ot  the  above  algorithms  has  been 
accomplished  by  making  extensive  use  of  the  ADT  data  structure. 
For  instance,  the  algorithm  of  section  3.5  tor  tetrahedra  generation 
employs  two  tree  structures:  one  for  in=  tacos  in  the  Iron!  and  the 
other  tor  the  s.dBS  defined  by  tho  intersection  between  each  pair  of 
laces  in  the  front  (see  figure  3  11).  This  combination  allows  a  high 
degree  of  flexibility  and  the  operations  ot  insertion,  deletion, 
geometric  searching  and  geometric  intersection  can  be  performed 
optimally.  The  overall  computational  perloimance  of  the  algorithm  is 
demonstrated  by  generating  tetrahedral  grids,  using  the  above 
method,  for  a  unit  cube  (see  figure  4.12).  Different  numbers  of 
elements  have  been  obtained  by  varying  the  grid  size.  In  figure  4.12 
the  computer  time  required  on  a  VAX  8700  machine  has  been  plotted 
against  the  number  NE  ot  elements  generated.  It  can  be  observed  that 
a  typical  NE’log(NE)  behaviour  Is  attained.  Using  this  approach  grids 
containing  up  to  one  million  elements  have  been  generated  and  no 
dogradation  In  the  performance  has  been  detected. 
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Figure  4.12 

Grid  genefalior  epu  lime*. 


S.  ADAPTIVITY  FOR  STEADY  STATE  PROBLEMS 

The  procedures  described  above  allow  lor  the  computation  ol  an 
initial  approximation  to  the  steady  state  solution  ol  a  given  problem. 
Ihis  approximation  can  generally  be  improved  by  adapting  the  grid  in 
some  manner.  Here,  we  lollow  the  approach  ol  using  the  computed 
solution  to  predict  the  desired  characteristics  (i.  a.  element  size  and 
shape)  lor  a  new,  adapted  grid.  The  ultimate  aim  ol  the  adaptation 
procedure  is  to  predict  Ihe  characteristics  ol  the  optimal  grid.  This 
can  be  defined  as  Ihe  grid  in  which  the  number  ol  degrees  ol 
freedom  required  to  achieve  a  specified  level  ol  accuracy  is  a 
minimum.  Alternatively,  it  can  be  interpreted  as  the  grid  in  which  a 
given  number  of  degrees  ol  freedom  are  distributed  in  such  a  manner 
that  the  highest  possible  solution  accuracy  is  achieved.  In  practical 
situations  however,  there  are  several  factors  which  make  the 
achievement  ol  such  optimal  grids  extremely  difficult  Some  of  these 
(actors  are: 

i)  The  concept  ot  optimality  is  intimately  linked  to  that  of  accuracy, 
which  is  not  uniquely  defined.  Hence  optimality  ol  a  grid  needs  to  be 
defined  with  respect  to  a  given  norm  or  measure  ol  the  error,  An 
additional  inconvenience  relatod  to  the  measure  of  accuracy,  in  the 
present  context,  arises  from  the  tact  that  we  are  attempting  to 
solve  a  coupled  set  at  non  linear  partial  differential  equations  and, 
therefore,  a  rigourous  measure  ot  the  error  should  involve  all  the 
relevant  variables. 

ii)  For  linear  elliptic  operators,  as  we  have  shown  in  Section  1, 
Gaierkin  finite  element  algorithms  are  readily  derived  which 
guarantee  that  the  approximation  obtained  is  the  most  accurate 
amongst  ol!  the  possible  approximations  within  the  trial  space  ot 
functions.  Heie.  accuracy  is  defined  with  respect  to  a  norm  implied 
by  the  operator  itsell  (the  energy  norm).  For  the  Euler  equations, 
however,  such  an  energy  norm  does  not  exist  and  no  numerical 
schemes  are  known  which  possess  this  optimality  property. 

iii)  This  best  approximation  property  means  that  the  error  ol  the 
computed  solution,  measured  in  the  energy  norm,  is  bounded  above 
by  that  ol  the  exact  interpolant.  I.e.  the  approximation  in  the  space 
ol  current  trial  functions  which  has  exact  nodal  values.  Using  results 
ot  interpolation  theory  (47),  it  is  then  possible  to  produco  rigourous 
bounds  on  the  error  ot  Ihe  numerical  approximation.  These  results 
are  tased  on  certain  regularity  assumptions  on  «■-»  solution,  which 
lor  the  Euler  equations  will  be  inval  ",c  viemity  ot 
discontinuities  in  the  tlow. 

iv)  Finally,  the  orror  estimates  produced  are  based  on  the  computed 
solution.  As  this  is  only  an  approximate  solution,  such  error 
estimates  will  only  be  as  good  as  the  computed  solution.  This  means 
that,  even  in  tha  best  situation,  the  optimal  grid  will  only  be 
achieved  in  the  asymptotic  limit,  i.e.  when  the  solution  is  so  gooo 
that  the  computed  error  becomes  very  reliable. 


In  view  ot  these  observations  and  limitations,  we  have  mado  an 
attempt  to  develop  a  heuristic  adaptive  strategy.  This  strategy  uses 
error  estimates  which  are  based  upon  concepts  from  interpolation 


theory.  The  possible  presence  of  discontinuities  in  the  cotution  is 
taken  into  account  and,  in  addition,  the  procedure  provides 
information  about  any  directionality  which  may  be  present  in  the 
solution.  The  advanteges  ot  using  directional  error  indicetore  become 
apparent  when  we  consider  the  nature  of  the  solutions  to  be 
computed  involving  flows  with  shocks,  contact  discontinuities  etc. 
Such  features  can  be  most  economically  represented  on  grids  which 
ore  stretched  in  appropriate  directions.  Although,  these  error 
estimates  have  no  associated  mathematical  rigour  considerable 
success  has  been  achieved  with  their  use  in  practical  situations. 

The  computed  error,  estimated  Irom  the  current  solution,  is 
transformed  Into  a  spatial  distribution  of  'optimal'  grid  spaclngs 
which  are  interpolated  using  the  current  grid.  The  current  grid  is 
then  modified  with  the  objective  ot  meeting  these  'optimal' 
distribution  ol  grid  characteristics  as  closely  as  possible.  Three 
alternative  procedures  will  be  discussed  here  (or  performing  the 
grid  adaption.  The  resulting  grid  Is  employed  to  produce  a  new 
solution  and  Ihis  procedure  can  repeated  several  times  until  the  user 
is  satisfied  with  the  quality  ol  the  computed  aolution. 

5.1  Error  Indicator  In  ID 

The  development  ot  a  method  tor  error  indication  is  considerably 
simplified  if  we  restrict  consideration  to  problems  involving  a  single 
scalar  variable.  For  this  reason,  when  solving  the  Euler  equations,  a 
key  variable  is  identified  and  then  the  grid  adaptation  is  based  on  an 
error  analysis  for  that  variable  alone  The  choice  ot  the  best 
variable  to  use  as  a  key  variable  remains  an  open  question,  but  the 
the  Mach  number  has  been  adopted  tor  the  computations  reported  in 
these  notes. 

Consider  first  the  one  dimensional  situation  in  which  the  exact 
values  of  the  key  variable  a  are  approximated  by  a  piecewise  linear 
function  c.  The  error  E  is  then  defined  as 


E  *  o(x')  -  d(x’)  (5.1) 

We  note  here  that  it  the  exact  solution  is  a  linear  function  ot  x1  then 
the  error  will  vanish.  This  is  because  our  approximation  has  been 
obtained  using  piecewise  linear  Unite  element  shape  (unctions. 
Moreover,  if  the  exact  solution  is  not  linear,  but  is  smooth,  then  it 
can  be  represented,  to  any  order  ot  prscision,  using  polynomial 
shape  functions. 


To  a  lirst  order  of  approximation,  the  error  E  can  be  evaluated  as 
the  difference  between  a  quadratic  tinite  eloment  solution  <5  and  Ihe 
linear  computed  solution.  To  obtain  a  piecewise  quadratic 
approximation  one  could  obviously  solve  a  new  problem  using 
quadratic  shape  functions.  This  procedure  however,  although 
possible,  is  not  advisable  as  it  would  be  even  more  costly  than  the 
original  computation.  An  alternative  approach  lor  estimating  a 
quadratic  approximation  from  the  linear  tinite  element  solution  is 
therefore  employed.  Assuming  that  the  node!  values  ot  the  quadratic 
and  linear  approximations  coincide  i.e.  the  nodal  values  ot  E  are 
zero,  a  quadratic  solution  can  be  constructed  on  each  element,  once 
the  value  of  the  second  derivative  Is  known.  Thus  the  variation  of 
the  error  E  within  an  element  e  can  be  expressed  as 

1  | 

E.»2  C(h.-C)  ^1  (5.2) 


where  £  denotes  a  local  element  coordinate  and  h.  denotes  the 
element  length.  A  procedure  for  estimating  tha  second  derivative  of 
a  piecewise  linear  function  is  described  below. 


The  root  mean  square  value  E»RMS  ct  this  error  over  the  element  can 
be  computed  as 


V 120 


(5.3) 


where  j  .  |  stands  for  absolute  value. 

We  define  the  'optimal'  grid,  for  a  given  degree  of  accuracy,  as  the 
grid  in  which  this  root  mean  square  error  is  equal  over  each 
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element.  In  the  present  context,  this  requirement  may  be  regarded 
as  being  somewhat  arbitrary.  However,  it  has  been  shown  (48]  that 
the  requirement  ol  equidistribu'.lor  of  the  error  leads  to  optimal 
results  when  applied  to  certain  elliptic  problems.  This  requirement  is 
therefore  written  as 


he* 


dx>2 


*  c 


(5.4) 


where  C  denotes  a  positive  constant. 

Finally,  the  requirement  ol  equation  (5.4)  suggests  that  the  'optimal' 
spacing  {  on  the  new  adapted  grid  should  be  computed  according  to 


S2 


=  C 


(5.5) 


where  Q.  is  on  arbitrary  unit  vector,  Sp  is  the  spacing  along  the 
direction  ol  ft,  and  m'i  are  the  components  ot  a  NxN  symmetric 
matrix  ot  second  derivatives 
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These  derivatives  ere  computed,  at  each  node  ot  the  current  grid,  by 
using  th9  N  dimensional  equivalent  ol  the  procedure  presented  In  the 
previous  section.  The  meaning  ot  equation  (S.9)  is  graphically 
illustrated  in  figure  5.1  Which  shows  how  the  value  of  the  spacing  in 
the  ft  direction  can  be  obtained  as  the  distance  trom  the  origin  to  the 
point  ol  Intersection  ot  the  vector  ft  with  the  surface  ot  an  ellipsoid. 
The  directions  and  lengths  of  the  axes  of  the  ellipsoid  are  the 
principal  directions  and  eigenvalues  ot  the  matrix  m  respectively. 


S.2  Recovery  ot  tho  Second  Derivatives 
The  first  derivative  of  the  computed  solution  on  a  grid  ot  linear 
elements  will  bG  piecewise  constant  and  discontinuous  across 
elements.  Therefore,  straightforward  differentiation  of  a  leads  to  a 
second  derivative  which  is  zero  inside  each  element  and  is  not 
defined  at  the  nodes.  However,  by  using  a  recovery  process,  based 
upon  a  variational  or  weighted  residual  statement  (12),  it  is  possible 
to  compute  nodal  values  ot  the  second  derivatives  trom  element 
values  ol  the  first  derivatives  ol  b 


To  illustrate  this  process,  consider  a  one  dimensional  domain  0  <  x1 
<  L  which  has  been  discretised  into  (n-1)  linear  two  coded  (Lite 
elements.  The  piecewise  linear  distribution  ol  the  computed  solution 
d  is  expressed  as 


A-ENjbj  (5.6) 

where  Njis  the  standard  linear  finite  element  shape  (unction  (12] 
associated  to  node  j.  Similarly,  a  piecewise  linear  approximation  to 
the  distribution  of  the  second  derivative,  which  we  seek  to 
determine,  can  be  written  as 
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(5.7) 


The  nodal  values  of  the  second  derivative  may  be  compared  trom  the 
approximate  variational  requirement  that 

"  f  (ft»  |  f  "  dN,  dN* 

jE  ( jNjNK«m)Sns|  -  -  J(jLt 

n  j  {1 

-  (^Nk),Uo  ?  <0nk>xU  K«  1,  ...n  (5.8) 

The  values  ol  the  derivatives  at  the  two  end  points  can  bo  inserted, 
it  known,  or  can  be  taken  to  be  equal  to  the  constant  value  ol  the 
derivative  in  the  adjacent  elements.  The  resulting  set  ol  algebraic 
equations  can  be  solved,  in  a  few  iterations,  by  using  a  Jacobi 
procedure  (16)  or  alternatively,  the  cons  stent  mass  matrix 
appearing  on  the  left  hand  side  of  equation  (5.7)  can  be  lumped,  thus 
yielding  a  diagonal  system  ot  equations.  Numerical  results  obtained 
to  date  do  not  indicate  any  significant  differences  in  tho  grids 
produced  by  using  these  two  approaches. 

5.3  Extension  to  Multl-Dlmenalcno 

Following  the  process  described  above,  nodal  values  ot  the  second 
derivative  can  be  obtained  trom  the  approximate  solution  on  the 
current  grid.  The  use  ol  expression  (5.5)  then  yielos  directly  a  nodal 
value  ol  the  'optimal'  spacing  tor  the  new  gud. 

Expression  (5.5)  can  be  directly  extended  to  the  N  dimensional  case 
by  writing  the  quadratic  form 


Several  alternative  procedures  exist  for  modifying  an  existing  grid 
in  such  a  way  that  tho  requirement  expressed  by  equation  (5.8)  is 
more  closely  satisfied.  Three  such  methods  will  be  described  here.  In 
the  first  procedure,  called  grid  enrichment,  the  nodes  of  the  current 
grid  are  kept  lixed  but  some  new  nodes/elements  ate  created.  In  the 
second  procedure,  referred  to  as  grid  movement,  the  total  number 
of  elements  and  nodes  remains  fixed  but  their  position  is  altered. 
Finally,  in  tho  adaptive  regriding  algorithm,  the  grid  adaption  is 
accomplished  by  completely  regenerating  a  new  grid  using  the  grid 
generation  algorithm  presented  In  section  3. 


Figure  $.1 

Th«  d»twmlnaUon  ©J  Iho  value  ol  the  spacing  5  along  tlu»  direction  ft. 


5.4  Grid  Enrichment 

In  order  to  adapt  a  grid  using  grid  enrichment,  a  sweep  over  all  the 
sides  in  the  g.kf  is  made  and  the  'optimal'  spacing  in  the  direction  ot 
each  side  is  computed  according  to  expression  (5.9).  For  each  side, 
the  matrix  m  is  taken  t  >  be  the  average  ol  its  value  at  the  two  nodes 
of  the  side.  The  enrichment  procedure  consists  of  introducing  an 
additional  node  tor  each  side  lor  which  the  calculated  spacing  is  less 
than  the  length  ot  the  side.  For  interior  sides,  this  additional  node  is 
placed  at  the  mid-point  of  the  side,  whereas  tor  boundary  sides,  it  is 
necessary  to  refer  to  the  boundary  definition  and  to  ensure  that  the 
new  node  is  placed  on  the  true  boundary.  When  any  skki  is  subdivided 
n  this  manner,  the  element-  associated  with  that  side  will  also  need 
to  be  subdivided  in  order  to  preserve  the  consistency  ot  the  find 
grid.  Figure  5.2  illustrates  the  three  possible  ways  in  which  this 
element  subdivision  might  have  to  be  performed  In  two  dimensions. 
The  number  of  sides  to  be  refined  depends  on  th6  choice  of  the 
constant  C  in  equation  5.9.  To  avoid  excessive  refinement  in  the 
vicinity  of  discontinuities,  a  minimum  threshold  value  tor  the 
computed  spacing  can  be  used.  When  the  grid  enrichment  procedure 
has  been  completed,  the  values  ol  the  unknowns  at  the  new  nodes  are 
linearly  interpolated  trom  the  original  grid  and  the  solution  algorithm 
is  re-started.  This  procedure  has  been  successfully  implemented  in 
two  and  three  dimensions  and  several  impressive  demonstrations  ol 
the  power  of  this  technique  have  been  made.  (8,19,49,50). 
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tl  can  be  observed,  from  the  examples  presented,  how  the  quality  ot 
the  solution  is  significantly  improved  by  the  application  ot  the 
enrichment  procedure.  The  main  drawback  ol  the  approach  is  that  the 
number  ot  elements  increases  considerably  following  each 
application  ot  the  procedure  This  means  that,  in  the  simulation  of 
practical  three  dimensional  problems,  only  a  small  number  ol  such 
adaptalions  can  be  contemplated. 


Figure  6.2 

Ttie  g(W  •michmunt  ptocos*  .nouing  three  ;«MbU  retirement  caeee  l«  e 


The  application  of  the  enrichment  procedure  in  the  solution  ol  a  two 
dimensional  example  is  illustrated  in  figure  5.3.  The  problem  solved 
Is  a  Mach  0.1 5  (low  past  a  double  ellipse  conliguralion  at  30°  angle 
ot  attack.  The  initial  grid  and  two  adaptively  enriched  grids  are 
shown  together  with  the  computed  Mach  number  solutions.  The 
application  of  tho  cnrichmenl  algorithm  in  three  dimensions  is  shown 
in  figure  S.4.  The  inviscid  (low  past  a  30°  wedge  is  solved.  The  tree 
stream  Mach  number  is  3.  This  is  a  two  dimensional  problem 
computed  on  a  throe  dimensional  grid.  Two  views  of  the  initial  grid 
and  solution  are  shown.  A  single  application  ol  the  enrichment 
algorithm  produces  the  grid  and  solution  which  are  also  displayed  in 
ligure  5.4. 


Figure  5.4 

Supersonic  flow  past  a  weope  of  angle  thirty  in  three  dimension*, 

(a)  (blVlewa  ot  the  Initial  grid,  (c)  The  computed  density  contours. 

(d)  (e)  Views  ol  the  enriched  grid,  tl)  The  computed  density 
contours  on  the  enriched  grid. 


5.5  Grid  Movement 

For  the  grid  movement  algorithm,  the  element  sides  are  considered 
as  springs  of  prescribed  stiffness  and  the  nodes  are  moved  until  the 
spring  system  Is  in  equilibrium.  Consider  two  adjacent  nodes  j  and  k 
as  shown  in  ligure  5.5.  The  force  Ijk  exerted  by  the  spring 
connecting  these  two  nodes  can  be  taken  lo  be 

tm  *  Cjk  ( U  •  iK  )  (5.1 1 ) 

where  Cjk  is  the  stiffness  of  the  spring  and  ij  and  are  the 
position  vectors  of  nodes  J  and  k  respectively.  Assuming  that 


h-  lu  -Ik  I  (5.12) 

the  adaptation  requirement  of  equation  (5.9)  wilt  be  satislied  if  the 
spring  stillnesses  ere  defined  as 
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SupWMrie  How  put  >  doubt*  Up.  a  ■  tree  omen  Mach  number  ot  a. 15  enS  el  en 
angle  ot  cue*  ot  thirty  Segreee  thowing  •  segue nee  at  ptW  end  eoKKton*  obtwned 
ton owing  the  tee  ot  ndopIVe  enrichment, 


Here  djk  is  the  unit  vector  in  the  direction  of  the  side  joining  nodes  j 
and  k.  For  equilibrium,  the  eum  of  spring  forces  tt  each  node  should 
be  equal  to  zrro.  The  assembled  system  can  be  brought  into 
equilibrium  by  simple  Iteration.  In  each  deration,  a  loop  is  ptrformed 
over  all  the  Interior  nodes  and  new  nodal  coordinates  are  calculated 
according  to  the  expression 
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where  the  summation  extends  over  the  number  ol  nodes.  Sj,  which 
surround  node  j.  Sufficient  convergence  is  normally  achieved  after 
three  to  five  passes  through  this  procedure. 


This  technique  will  not  necessarily  produce  grids  of  belter  quality, 
as  badly  formed  elements  can  appear  in  regions  (such  as  shocks)  in 
which  the  spring  coefficients  vary  rapidly  over  a  short  distance. 
To  avoid  this  problem,  tho  definition  ol  the  value  ol  CJK  given  in 
equation  is  (5.13)  can  be  replaced  by  an  expression  of  the  form 

(5-,5) 

This  can  be  regarded  as  a  blending  function  definition  for  (he  spring 
stiffnesses  and  it  has  been  constructed  so  as  to  ensuro  that,  with  a 
suitable  choice  for  the  constants  A  and  B,  excessively  small  or 
excessively  large  element  sizes  are  avoided.  This,  in  turn  moans 
that  grids  of  acceptable  quality  will  be  produced.  More  sophisticated 
procedures  for  controlling  the  quality  of  the  grid  during  movement 
can  also  be  devised  [51]  and  grid  movement  algorithms  have  been 
successfully  used  in  two  and  three  dimensional  flow  simulations  on 
both  structured  and  unstructured  grids  [15,51,52]. 

The  grid  movement  algorithm  described  has  been  applied  to  the 
problem  at  Mow  past  a  double  ellipse  configuration  which  has  been 
treated  previously.  Figure  5.6  shows  the  solutions  produced 
following  two  grid  adaptations.  It  can  be  seen  that  the  Improvement 
obtained  after  the  second  adaptation  is  minor.  This  is  becauss  the 
algorithm  does  not  allow  tor  the  creation  of  new  nodes  and  so  the 
quality  of  the  final  solution  is  very  much  dependent  on  the  topology 
of  the  initial  grid.  This  is  a  major  drawback  of  the  grid  movement 
strategy.  A  possible  remedy  to  this  problem  is  to  combine  grid 
enrichment  and  grid  movement  procedures.  This  is  demonstrated  in 
figure  5.7  which  shows  the  application  ol  fhe  movement  procedure  to 
the  final  enriched  grid  of  figure  5.3. 

5.6  Adaptive  Regrldlng 

The  basic  idea  of  the  adaptive  regridlng  technique  is  to  use  the 
computed  solution  to  provide  information  on  the  spatial  distribution 
of  the  grid  parameters.  This  information  will  be  used  by  the  grid 
generator  described  in  section  3  to  generate  a  completely  new 
adapted  grid  for  the  problem  under  investigation. 
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Figure  5.6 

Supersonic,  fbw  past  a  double  efilp«.o  m  *  tree  stream  Mach  number  of  tt.15  and  at  an 
angle  ot  attack  of  thirty  degrees  showing  a  sequence  <A  yiiJs  and  solutions  obtained 
following  the  use  ol  adaptive  grid  movement 


Figure  57 

Supersonic  (low  past  a  double  skipas  at  a  free  stream  Mach  number  ot  0.1$  and  at  an 
angle  of  attack  of  thirty  degrees  showing  the  sotutiene  obtained  following  fhe 
application  of  adaptive  grid  movement  to  a  previously  enriched  grid. 


The  'optimal'  values  for  the  grid  parameters  are  calculated  at  each 

node  of  the  current  grid.  The  directions  oi:  i-i . n,  are  taken  to  be 

the  principal  directions  ot  the  matrix  m .  The  corresponding  grid 
spacings  are  computed  from  the  eigenvalues  e*;  i-i . n,  as 

f  Cl  *  ' 

®l  «|~j  tor  i-t . N  (5.16) 


5-32 


The  spatial  distribution  ol  the  grid  parameters  Is  defined  when  a 
value  is  specified  for  the  constant  C.  The  total  number  of  elements  in 
the  adapted  grid  will  depend  upon  the  choice  of  this  constant,  Tor 
smooth  regions  of  the  flow,  this  constant  will  determine  the  value  of 
the  root  mean  square  error  in  the  hey  variable  that  we  are  willing  to 
accept.  Therefore  this  constant  should  be  decreased  each  time  a  new 
grid  adaption  Is  performed.  On  the  other  hand,  solutions  of  tha  Euler 
equations  are  known  to  exhibit  discontinuities.  At  such 
discontinuities,  the  root  mean  square  error  will  always  remain  large 
and  therefore  a  different  strategy  is  needed  in  the  vicinity  of  such 
features. 

In  the  practical  implementation  of  tho  present  method,  two  threshold 
values  for  the  computed  spatial  distribution  ol  spacing  are  used:  a 
minimum  spacing  and  a  maximum  spacing  $maxi  sc  that 

Smln  s  *1  S  Smu  for  i«1 . N  (5.17) 

Tne  reason  for  delining  the  maximum  value  Is  to  account  for  Ihe 

possibility  of  a  vanishing  eigenvalue  in  (5.16)  which  would  render 
that  expression  meaningless.  The  value  ol  8mal  is  chosen  as  Ihe 
spacing  which  will  be  used  in  the  regions  where  Ihe  How  Is  uniform 
(Ihe  far  field,  for  instance).  On  Ihe  other  hand,  maximum  values  of 
the  second  derivatives  occur  near  tire  discontinuities  (it  anv)  ol  the 
flow  where  the  error  indicator  will  demand  thal  smaller  ©le'wna 
are  required.  By  imposing  a  minimum  value  Smln  lor  the  grid  size,  wa 
attempt  to  avoid  an  excessive  concentration  ol  elements  near 
discontinuities.  As  the  (low  algorithm  is  known  to  spread 
discontinuities  over  a  fixed  number  ot  elements  (i.e.  two  or  three), 
6mi„  is  therefore  set  to  a  value  that  is  considered  appropriate  to 
ensure  that  discontinuities  are  represented  to  a  required  accuracy. 
This  treatment  also  accounts  lor  the  presence  ol  shocks  of  different 
strength  in  which,  since  the  numerical  values  of  the  second 
derivative  are  dilferent,  equation  (S.16)  will  assign  them  different 
grid  spacing;  (e  g.  larger  spacings  in  the  vicinity  of  weaker  shocks). 

The  total  number  ol  elements  generated  In  the  new  grid  will  now 
depend  on  the  values  selected  for  C,  Sna>,  and  Smin.  However,  it 
turns  out  that  this  number  is  mainly  determined  by  the  choice  at  the 
constant  C,  which  is  somehow  arbitrary.  The  criterion  employed 
here  is  to  select  a  value  that  produces  a  computationally  affordable 
number  of  elements. 

The  adaptive  regriding  strategy  presented  in  this  section  is 
illustrated  in  figure  5.8  by  showing  the  various  stages  during  the 
adaptation  process.  Figure  5.8(a)  shows  Ihe  initial  grid  employed  tor 
the  computation  of  the  supersonic  flow  past  a  double  ellipse 
configuration.  The  Mach  number  contours  ot  the  solution  obtained  on 
the  initial  grid  are  shewn  in  figure  5.8(b).  Tho  flow  conditions  are  e 
Iree  stream  Mach  number  of  8.1S  and  an  angle  of  attack  of  30°.  Tho 
application  ot  expression  5.16  to  the  solution  obtained  produces  Ihe 
distribution  ot  spacing  and  stretching  displayed  in  figures  5.8(c)  and 
5.8(d)  respectively  In  ligure  5.8(c)  the  contours  corresponding  to 
Ihe  valuo  ot  th9  minimum  spacing  occuring  in  any  direction  is  shown, 
whereas  in  figure  5.8(d)  the  valuo  ol  and  the  direction  of  stretching 
is  displayed  in  the  form  ot  a  vector  tietd.  The  magnitude  ot  the 
vector  represents  the  amount  ot  stretching  i.e.  ratio  between 
maximum  end  minimum  spacings.  and  the  direction  ot  the  vector 
indicates  the  direction  along  which  tho  spacing  is  maximum.  In  this 
oxample  expression  5.17  has  been  applied  to  the  computed  spacings 
with  values  ot  8m„  »  15  and  8min  »  0.9.  Figures  5.8(e)  •  5.8(h) 
show  various  stages  during  the  regeneration  process.  It  can  be 
observed  how  small  elements  are  generated  first  as  discussed  in 
section  3.5.  The  completed  grid  is  shown  in  ligure  5.8(i)  and  the 
solution  computed  on  this  adapted  grid  is  shown  in  figure  5.8(j).  It 
can  be  observed  how  a  very  significant  improvemeni  in  tho  solution 
is  obtained  using,  in  this  case,  a  single  adaptation. 

Estimating  the  Number  el  Elements  to  be  Generated 

The  regeneration  process  uses  the  current  grid  as  the  background 
grid.  Such  a  background  grid  clearly  represenls  accurately  the 
geometry  ot  the  computational  domain.  In  this  case,  the  number  ot 
oloments  to  be  generated,  denoted  by  N„  can  be  estimated  as 
follows.  Once  the  values  ol  C,  and  have  been  selected,  the 

spatial  distribution  ol  grid  parameters  8|,  tp;  i*i . n  is  computed. 

For  each  element  of  the  background  grid,  the  values  ot  the 
transformation  T  is  computed  at  the  centroid.  The  transformation  is 
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Figure  9.$ 

lltMlratfon  of  th*  ad^tlve  regrldlrvg  procedure  applied  In  the  analysis  ol  supersonic 
How  pert  a  double  otMpaa  at  a  fra*  etreem  Mach  number  ol  8.15  and  at  an  angle  ©* 
attach  ol  thirty  degree*  .  (a)  Initial  grid,  (b)  Mach  number  contours  computed  on  the 
MtW  grid.  <c)  The  compiled  distribriion  of  spacings,  using  6m  In  »  9  and  5max  ■  IS. 
<d)  The  computed  values  end  directum  lor  the  stretching,  (a)  -  (h)  Views  of  the  gild 
during  different  stage*  of  the  regrWrrg,  (i)  Final  adapted  grid  (j)  The  Mach  number 
contour  distribution  produced  on  the  adapfluety  regenerated  grid. 
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applied  to  the  nodes  of  (he  element  and  its  volume  V,  in  the 
normalised  space  Is  computed  The  number  ot  elements  N.  is  Assumed 
to  be  proportional  to  tho  total  volume  in  the  unstretched  space,  i.e. 

H> 

N,-xXV'  (518) 

0a  I 

where  Nb  is  the  number  ol  elements  in  the  background  grid  and  x  is  a 
constant.  The  value  of  %  is  calculated  as  a  statistical  average  of  the 
values  obtained  tor  several  generated  grids.  The  calculated  value  Is 
X  =  9.  This  procedure  gives  estimates  ol  the  value  of  N,  with  an 
error  of  less  than  20%,  which  is  accurate  onough  lor  most  practical 
purposes.  II  the  estimated  value  ol  N,  is  either  too  big  or  tco  small, 
then  the  value  ot  C  is  reduced  or  increased  and  the  process  repeated 
until  the  value  ol  C  produces  a  number  ol  elemenls  which  is  regarded 
as  being  computationally  acceptable. 

Application  Examples 

Doubt#  Ellipse 

The  adaptive  regriding  procedure  is  applied  twice  to  the  problem  ol 
flow  past  a  double  ellipse.  The  flow  conditions  are  those  previously 
considered  (or  this  configuration.  The  initial  and  two  adapted  grids 
and  the  solutions  fcr  Mach  number  are  shown  in  figure  S.9.  The 
characteristics  ol  the  grids  employed  are  displayed  in  table  5.1. 
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Shock  Interaction  on  a  Swept  Cylinder 
This  Is  a  problem  of  practical  Interest  because  Its  Implications  to  the 
design  ot  hypersonic  vehicles  |531.  The  experimental  apparatus  and 
the  computational  domain  adopted  are  shown  dlagramalktally  in 
tigure  S.tO(a).  The  numerical  simulation  has  been  canted  out  tor  a 
sweep  angle  ot  15°  on  a  cylinder  ol  dlemeter  D  equal  to  3  inches  and 
length  L  equal  to  9  ir»che6,  The  undisturbed  tree  stream  Mach  number 
is  8.03.  The  fluid  which  has  heen  turned  by  the  shock  generator 
enters  the  computational  domain  wilh  a  Mach  number  ot  5.20.  The 
initial  grid  and  those  obtained  alter  two  adaptive  regridlngs  and  the 
density  contours  distribution  are  shown  li,  tigure  5.10(b).  The 
characteristics  ot  the  grids  ate  shown  in  table  5.2. 
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Table  5.3  3D  Shock  Interaction  on  a  swept 
cylinder:  grid  characteristics. 

The  pctenual  advantages  of  tire  adaptive  regriding  procedure  are 
clearly  illustrated  in  this  three  dimensional  example.  The  tinal 
adapted  grid  has  a  resolution  ot  more  than  live  times  that  ot  the 
initial  grid  whereas  the  total  number  ol  degrees  ot  freedom 
increases  by  only  a  factor  ol  3.4.  The  elfec's  of  the  three 
dimensional  adaptation  are  best  shown  in  tigure  5.10(c)  which  shows 
the  cross  section  through  the  grids  hall  way  along  the  cylinder.  Two 
views  ol  Ihe  throe  dimensional  grid  lor  the  tinal  adaptation  together 
with  the  solution  obtained  are  shown  in  figures  5.10(d)  and  5.10(e) 
respectively. 
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Table  5.1  Doublo  oil  ipso  (M_=8.15,  n---30°):  grid  characlerislics. 

It  is  observed  how  the  application  ol  the  adaptive  procedure,  wiien 
compared  to  the  enrichment  strategy,  allows  for  a  larger  incroase  in 
me  resolution  at  the  expense  ot  a  smaller  increase  on  total  number 
of  elements.  On  the  other  h„nd  the  regridina  procedure  does  nol 
suiter  Irom  Ihe  limitations  inherent  in  the  grid  movement  algorithm. 
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Figure  6.10 

Shock  interaction  on  a  swept  cylinder  a)  a  Ire#  slrwm  Mach  number  ol  ft  15  and  at 
<iti«*en  degrees  nnglo  ot  swoep  (a)  Stretch  ot  the  experimental  apparatus  and  the 
chosen  comnulal  tonal  domatn.  (b)  The  seqiarnc#  nl  grids  and  oorro  sending  computed 
density  contours  obtained  by  using  adaptive  regodmg.  (c)  Cross  sections  taken 
through  the  3D  gfk*'s  halt  way  along  the  cylinder  (d)  The  third  adapted  grid,  (t)  Detail 
oi  the  shock  Interaction  on  the  final  grid. 


Ganarle  Fighter  Configuration 

This  oxampla  concerns  the  simulation  ol  Ihe  flow  pasl  a  generic 
fighter  configuration  The  generation  ol  Ihe  Initial  grid  for  that 
problem  has  been  described  in  section  3.9.  The  How  condilions 
considered  correspond  to  a  tree  stream  Mach  number  of  2  at  an 
angle  ol  attack  ot  3.79°.  The  engine  inlet  is  modelled  by  prescribing  a 
Mach  number  of  0  3  wilhin  the  engine.  At  Ihe  outlet  supersonic  flow 
conditions  are  assumed.  Boeause  ol  the  symmetry  ot  Ihe  problem 
only  half  of  the  domain  is  modelled.  The  epline  definition  of  the 
goornotry  is  shown  in  figure  5.  11(a)  and  consists  of  23  surface 
components  and  53  curve  components.  Two  grids  have  been 
employed  in  an  initial  demonstration  ot  adaptive  iegriding  applied  to 
full  aircraft  configuration.  Tho  initial  grid  contains  76,522 
tetiahedral  elements  with  4,128  triangular  laces  on  the  boundary.  A 
preliminary  first  solution  was  computed  using  1 ,500  iterations  of 
the  basic  explicit  scheme.  A  second  grid  was  adaptively  generated 
using  the  Mach  number  as  the  key  variable  in  the  error  analysis.  The 
now  grid  is  formed  by  70,125  tetiahedm  with  7,262  triangles  on 


Flgura  5.11 

Oanwic  Hgtrlvr  conifjurallon  «  a  Iim  Flraan  Marti  numtoar  ol  2  am  at  an  angla  o! 
attack  ot  379  degrees.  (a)  Geometry  definition  ■  aircraft  surface  and  outer  boundary 
(Jj)  Initial  grid  and  computed  preesu'e  eotuHon  in  tho  eymmetry  plane  (c)  Second  grid 
and  computed  prete’tfe  solution  on  the  symmetry  plan*.  (d)  Initial  grid  and  compiled 
pressure  toMtoo  on  the  aeroplane  surface,  (a)  Secortd  grid  and  computed  pressure 
solution  on  the  aeroplane  suHace. 


the  boundary.  It  is  inloresting  to  notlco  that  tho  number  cl  elements 
in  the  two  grids  is  approximately  the  same  whereas  Ihe  number  ot 
tacos  on  the  surl&ee  has  increased.  Moreover,  the  minimum  spacing 
on  tho  adeptod  grid  Is  3.5  times  smallot  than  tho  on<>  on  the  Initial 
grid,  thus  indicating  also  an  Incroaso  In  the  grid  resolution.  Tho 
solution  on  the  now  grid  was  obtained  attor  2,000  Iterations.  The 
grids  and  computed  solutions  ot  tho  plane  ol  symmetry  are  shown  !n 
figures  5.11(b)  lor  Ihe  Initial  grid  and  5.11(c)  lor  the  adapted  grid 
The  effect  ol  tho  adaptation  in  tho  vicinity  oi  the  engine  Inlet  can  be 
observed.  The  grid  and  solution  on  the  surlace  ol  tho  aircraft  is 
shown  in  liguro  5.11  (d)  for  the  initial  grid  and  in  figure  5.11(e)  tor 
the  adapted  grid.  In  this  cose  tire  adaptation  is  very  mild  and  is 
hardly  noticeable,  The  main  roason  tor  this  is  that  tho  resolution  on 
the  initial  grid  is  rather  poor  and  some  important  llow  toaluros  are 
not  properly  captured. 
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6. TRANSIENT  FLOWS  (This  section  has  been  written  in 
collaboration  with  E.  J.  Probert  EOMS,  University  College,  Swansea 
SA2  8PP.  OK  and  0.  Hassan,  COR,  Innovation  Centre,  University 
College,  Swansoa  SA2  8PP.  UK) 

6.1  Transient  Flow* 

Solutions  ol  tho  Euler  equations  are  smooth  over  large  areas  ot  the 
computational  domain  and  exhibit  large  gradients  in  localised  parts  of 
the  tlow.  In  transient  simulations,  thoso  localised  regions  will 
generally  move  through  the  computational  domain  and  may  sweap 
across  very  large  areas  o.g.  lire  case  ot  Hows  involving  propagating 
shocks.  Ttiis  means  that,  unless  adaptivity  is  used,  a  globally  lino 
grid  will  bo  nocossary  to  provide  tho  requlrod  resolution.  Thus  the 
use  ol  adaptivity,  with  tho  possibility  tor  local  grid  refinement  and 
coarsoning.  otters  the  potential  tor  considerablo  computational 
savings.  We  have  already  seen  that  only  a  low  grid  adaptations  aro 
generally  noedod  to  obtain  a  satisfactory  solution  lo  a  stoady 
problom.  but  wo  can  expect  that  grid  adaptation  will  have  to  bo 
periormed  several  thousand  timos  in  a  transient  tlow  simulation. 
Thus  any  potential  computational  savings  which  appear  to  be  ottered 
by  tho  usn  ol  adaptivity  in  this  case  will  only  bo  realised  it  tiro 
adaptation  ot  Ihe  grid  can  be  performed  in  an  efficient  mnnnor. 
Successlul  implementations  of  adaptivity  !o  tho  solution  ot  transient 
problems  have  already  been  made  within  tho  contoxt  ot  both 
structured  (5a)  and  unstructured  grids  (55,56). 

6.2  Crid  Enrichment 

An  obvious  method  ot  achieving  grid  adaptation  for  transient  tlow 
simulation  is  the  extension  ot  the  grid  enrichment  ideas  introduced 
above  lot  the  solution  ot  steady  state  problems.  An  extremely 
successlul  implementation  on  unstructured  triangular  grids  has  been 
made  by  Lfihner  (55).  In  his  method,  the  grid  is  automatically  refined 
and  deretined  as  necessary  according  to  the  results  ol  an  error 
indicating  process.  An  example  (56)  ot  tho  application  ol  this 
procedure  to  shock-bubble  Interaction  problem  i  shown  in  figure 
6.1.  This  oroblem  involves  the  Interaction  between  a  weak  shock, 
travelling  at  a  Mach  number  ot  ’  .29  In  air,  and  a  bubblo  ol  hoavlor 
material  (treon).  From  tho  figure  it  can  be  seen  how  the  shock  speed 
inside  the  bubble  decreases,  owing  to  the  higher  density  of  the  fteon, 
whoroas  tho  outer  shock  bends  over  The  inner  shock  tocusos  at  the 
right  hand  end  ot  the  bubble,  producing  a  significant  over  pressure 
and  initiating  a  small  circular  blast  wave  This  method  has  nlso 
rocomly  been  applied  to  throe  dimoasional  tlow  simulations  (57). 

6.3  Transient  Flows  Involving  Moving  Bodies 

The  comple  ity  involved  in  Handout  llow  simulation  increases  it  ono 
considers  probloms  in  which  certain  boundaries  ol  Ihe  computational 
domain  are  allowed  lo  movo,  so  that  Ihe  geometry  ol  Ihe  domain 
changes  with  lime.  This  means  that  the  grid  must  be  modified  during 
the  computation  in  order  to  accommodate  these  geometrical  changes. 
Ono  approach  which  has  proved  to  be  successlul  tor  tackling  such 
problems  is  the  chimera  approach,  in  which  each  individual  geometry 
component  can  have  its  own  associated  structured  grid  which  con 
move  independently  of  the  ofher  grids.  Three  dimensional  viscous 
simulations  involving  moving  bodies  have  already  bean  produced  by 
this  method  (56).  Unstructured  grids  have  been  applied  to  the 
solution  ot  inviscid  two  dimensional  transient  (lows  involving  moving 
bodies  (59,60),  using  a  method  which  is  an  extension  of  tho 
regriding  procedures  presented  in  section  5.6  and  this  is  the 
approach  that  will  ue  presented  here. 
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Flour*  6.1 

Shack  bubtol*  Interaction  probkun  using  it  art*  km  adaptation  on  enrichment 

(Irom  (56)).  (a)  Initial  grid  and  volution  contour*.  (b)  GrW  and  solution  at  wOO  (c) 
Grid  and  solution  at  U07  (d)  Grid  and  solution  at  uo  e 


Wo  restrict  our  consideration  to  two  dimensionui  mvisuid  (lows  and 
note  that  the  basic  variational  statement  tor  tho  problom  will  need  to 
be  moditiod  to  account  tor  tho  tact  that  ihe  spatial  domain  ll  is 
varying  with  lime.  Suppose  that  wo  have  tho  solution  y„  at  a  certain 
lime  level  t„.  We  attempt  to  satisfy  tho  comprossibio  Eulor  equations 
(1.38)  over  the  space-time  domain  0  -.=  (0(t),  t„  s  t  s  t„.,)  To 
oxpress  this  problem  in  a  variational  form  wo  need  to  introduco 
suitable  trial  and  weighting  function  sets.  We  assume,  lor  the 
purposes  of  this  discussion,  that  tho  conditions  on  the  boundary  r  of 
O  can  bo  expressed  in  the  form 


Although  such  conditions  are  somewhat  unrealistic,  Ihe  actual 
boundary  conditions  which  would  need  10  be  applied  in  the  simulation 
ot  a  givon  problem  can  be  readily  incorporated  by  making 
appropriate  modifications  to  the  following  analysis  Wo  may  dotino 

T  =  ( u  I U  =  a  on  r;  u  »  Ue  on  «  at  t  o  tn) 

(6.2) 

W  .  { W  |  W  -  0  on  I  ) 

and  a  variational  formulation  ot  tho  problem  can  be  stated  as  :  find  U 
in  7  such  that 
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for  ovary  W  in  w.  We  will  assume  that  tha  spatial  domain  Q  has  been 
discretised  using  3  noded  linear  triangular  elements,  with  interior 
nodes  numbered  from  t  to  p  and  introduce  the  sets 
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t(p>  '  (  U(p)  I  U<pi  *  Mi  U1+M2  Uji . +*^1^;  U/j»  «Q  on  r, 

U<p)  -  JJn  on  n  at  1 . 1,,) 

(6.4) 

W(p>  =  ( W(P)  |W(p)  «  «,M,+a2M2  +..*apMp  ;  W(p)  >0  on  r) 

In  the  Approach  which  is  to  bo  followed  hero,  certain  nodes  in  the 
grid  will  be  tixou.  while  others  will  move  with  a  proscribed 
volocity.  The  shape  functions  Mj  are  linear  functions  of  space  and 
time  which  satisfy 


Mj(Jbtn>  =  Nj(i)  MjUUn.U  =.  nV(iU  (6.5) 


Where  Nj  Is  the  standard  finite  otomont  shape  function,  doiinod  in 
soction  1.2,  associated  to  node  j  at  time  l„.  Working  with  the 
function  sets  dellned  In  equation  (6.4),  the  Galerkin  approximation 
statement  takes  the  form  :  find  U,pl  in  7„„  such  that 


aid*. 

dx 


I 
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dll  dt 
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(6.6) 


•or  J  »  1.2 . P-  Considering  tho  first  term  appearing  in  this 

integral,  it  is  possible  to  show  that 

J  Mj  dCl  =  dj  Wj  Uft»  d(i  •  J  *  “•  M-*  Ufa) 

(6.7) 

•J  Tt  ^'da 


6. a  Adaptive  Regrldlng  lor  Trenelenl  Flow*  Involving 
Moving  Bodies 

The  method  described  above,  whereby  a  grid  may  be  adapted  by 
regriding,  is  a  natural  approach  to  follow  tor  the  simulation  of  flows 
involving  moving  bodies.  It  will  be  assumed  tnai  tha  motion  ot  any 
moving  boundary  is  proscribed  and  the  objective  is  to  determine  the 
resulting  llow  Held.  The  description  of  an  algorithm  which  can  be 
devised  to  advance  the  solution  ot  equation  (6.1 1)  In  time  can  bo 
written  as  follows: 

1 .  Generate  an  Initial  grid  to  represent  tho  computational  domain  and 
to  adequately  resolve  the  Initial  solution. 

2.  Start  the  time-step  loop 

2.1  Advance  tha  solution  one  timo-step 

2.2  Update  tho  coordinates  ot  the  points  on  the  moving  boundaries 

2.3  Use  an  error  indicator  to  oxan’ine  (lie  current  solution  and  define 
an  'optimal'  distribution  ol  grid  spacing  and  stretching 

2.4  Compare  tho  current  grid  with  tho  'optimal'  grid.  Doloto  tho 
plomnnts  whose  size  and  shape  is  too  dilforont  tram  tha  optimal 

2.5  Triangulate  the  regions  where  elements  have  been  deleted 
according  to  tho  now  distribution  ot  grid  parameters 

2.6  Oetermino.  by  Interpolation,  the  llow  variables  at  the  new 
nodes 

fcnd  tho  time-stop  loop 

It  is  apparent  tha!  the  crucial  phase  in  this  process  is  the  giid 
adaptation  in  stops  2.3-2.S.  The  mechanics  ol  this  process  is 
lllustratod  diagramnticaiiy  in  figure  6.2.  Tho  success  ol  tho 
procedure  depends  upon  tho  reliability  ol  tho  otror  Indicator  which  Is 
omployed.  The  Indicator  ot  equation  (5.9)  has  again  been  used  lor 
this  application. 


whore  y  denotes  tho  volocity  ol  the  moving  nodes  With  iqp)  -  (v„, 
vy)  interpolated  linoorly  botwoen  tho  nodal  values  ol  y.  an  observor 
moving  with  tho  grid  will  not  detect  any  chango  in  the  shape 
lunenons  1  0 
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Wlioro  D/Dt  donotos  ditloronliation  following  the  moving  grid  and  so 
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Finally.  combine  q  equations  (6  6).  (6  7)  and  (6.9),  tho  Galorkln 
approximation  sulislios 


f  Mj  U(p|  dQ  -  |  Mg  JJjp)  do 
,Jn»i  lVi 


whore 


(6.10) 


*  •  w  v ; 

i  V  v\  Vi 
•,  ■.  ’>  } 


Figur*  6.2 

lllustfiMton  o!  tin*  aonptivo  lamttshtnj)  p(oc^duf*  to  tiauxtonl  pfoblems  (a) 

lnrtl.il  giW  (h)  Marked  unwarned  r»od«»  and  etomeoH  (c)  Elements  are  removed  Irom 
tin*  9*Kt  (it)  Dounv1(uy  skies  -V#  jjenertfed  according  the  Efwcinq  distribution 
to  tOfam  ft  closed  loop  arround  each  bote  (e)  Triaogulahon  ot  tlte  hotel  estng  'I**' 
odviitccn^  hoot  Art <3  ih*  new  distribution  ot  <|Vicirtfl*. 


*•(»>)  ”  -  £{p)  '  vy  U(p)  (61  i  ) 

Inserting  the  assumed  term  tor  from  equation  (0.4), 

*  i 

(M  UJj"'’  •  IM  U]j"  =  -  |  *  °%1-]  dvl  dt  (6.12) 

*n 

The  integral  appearing  bore  can  be  evaluated  by  litst  employing  ono 
point  integration  in  time  (at  t  =  !„,,/;>)  and  then  using  a  two-step 
approximation  |18,I9).  Attificial  viscosity  will  again  be  noodoa  with 
a  scheme  ot  this  type  and  the  resolution  ol  the  resulting  scheme  may 
Ite  improved  by  tho  use  ot  tho  PC  7  idoas  mentioned  in  Section  1.3. 


$.5  Application  Examples 
ID  Shock  Propagation 

The  first  example  considered  consists  ol  a  two  dimensional 
simulation  of  the  transient  development  ol  tho  (low  in  shock  tube. 
Tho  purpose  ot  this  example  is  to  illustrate  the  application  ol  tho 
regridlng  algorithm  to  a  transient  problem  with  lixod  boundaries 
Tho  inilial  conditions  are  such  that  the  solution  consists  oi  a  single 
propagating  shock.  Figure  6.3  depicts  the  development  ot  ttie  grid  arid 
solution  as  the  shock  propagates.  It  can  bo  observed  that  tho  shock 
movemont  Is  adequately  followed  by  tho  adaptation  ol  Ihe  grid.  Noto 
also  that  the  number  of  elements  in  Ihe  grid  remains  approximately 
constant  as  the  solution  progresses. 
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Figure  6.3 

Stack  Inta  oxitmplo  usmg  adoptively  «omed  mestas  (a)  Adopted  gta  and  density 
contours  at  1*0  (!>)  Adapted  prtd  and  density  contours  at  t*=4  (c)  Adapted  nta 
density  contous  nt  t*B 


Inlagrated  Space  Shuttle  Vehicle  Simulation 

A  two  dimensional  computation  has  hoon  nttomptod  which  involves  n 
simulated  space  shuttle  separating  Imm  ti-e  teckel  booster  Th« 
relative  motion  ol  Ihe  shuttle  with  respect  to  tho  booster  has  boon 
proscribed  extotnally.  Tho  Iroo  stream  conditions  corrospond  to  a 
Mach  numbor  of  2  and  an  angle  of  attack  o<  -4°  with  rospecl  to  lire 
mitia  position  ot  tho  shuttlo  An  initial  stoady  state  was  computed 
tor  tho  configuration  shown  in  hguro  6.4(a).  Figures  6.4(b)-6.4(d) 
illustrate  tho  development  ol  tho  grid  and  solution  ns  tho  separation 
proceeds 
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Introduction 

One  of  the  major  achievements  in  engineer¬ 
ing  science  has  been  the  development  of  com¬ 
puter  algorithms  for  solving  nonlinea*-  differential 
equations  such  as  the  Navicr-Stokcs  equations. 
These  algorithms  are  now  used  in  the  practical 
engineering  design  if  devices  such  as  cars  and 
airplanes  as  well  as  theoretical  studies  of  com¬ 
plex  phenomena  such  as  fluid  turbulence.  In  past 
years,  limited  computer  resources  have  motivated 
the  development  of  efficient  numerical  methods 
in  computational  fluid  dynamics  (Ch  D)  utilizing 
structured  meshes.  These  meshes  are  comprised 
of  systematic  arrays  of  quadrilateral  or  licxahe- 
dral  cells.  The  use  of  structured  meshes  greatly 
simplifies  the  implementation  of  CFD  algorithms 
on  conventional  computers.  Structured  meshes 
also  permit  the  use  of  highly  efficient  solution 
techniques  such  as  alternating  direction  implicit 
(ADl)  iteration  schemes  or  multigrid.  Following 
the  dramatic  improvement  in  computing  speed  in 
recent  years,  emphasis  has  shifted  towards  the  de¬ 
sign  of  algorithms  capable  of  treating  complex 
geometries.  The  automatic  generation  of  struc¬ 
tured  grids  about  complex  geometries  is  prob¬ 
lematic.  Unstructured  grids  offer  one  promising 
alternative  teclmiq.  for  treating  these  general 
geometries.  Unstructured  meshes  have  irregular 
connectivity  and  usually  contain  triangles  and/or 
quadrilaterals  in  two  dimensions  and  tetrahedra 
and/or  hexahedra  in  three  dimensions.  The  gen¬ 
eration  and  use  of  unstructured  grids  poses  new 
challenges  in  computational  fluid  dynamics.  This 
is  true  tor  both  grid  generation  as  well  as  for 
the  design  of  algorithms  for  flow  solution.  The 
purpose  of  these  notes  is  to  present  recent  devel¬ 
opments  in  the  unstructured  grid  generation  and 
flow  solution  technology. 

1.0  Preliminaries 

1.1  Graphs  and  Meshes 

Graph  theory  offers  many  valuable  theoret¬ 
ical  results  which  directly  impact  the  'sign  of 
efficient,  algorithms  using  unstructured  grids.  For 
purposes  of  the  present  discussion,  only  simple 
graphs  which  do  not  contain  self  loop's  or  parallel 
edges  will  he  considered.  Results  concerning  sim¬ 
ple  graphs  usually  translate  directly  into  results 
relevant  to  unstructured  grids.  The  most  famous 
graph  theoretic  result  is  Euler’s  formula  which  re¬ 
lates  the  number  of  edges  n(e),  vertices  n(v),  and 
faces  »(/)  of  a  polyhedron  (see  figure  1.0(a)): 

?»(/)  =  «(e)  -  n(v)  +  2  (Euler's  formula) 

(1.0) 


This  polyhedron  can  be  embedded  in  a  plane  by 
mapping  one  face  to  infinity.  This  makes  the 
graph  formula  (1.0)  applicable  to  2-D  unstruc¬ 
tured  meshes.  In  the  example  below,  the  face  1- 
2-3-4  has  mapped  to  infinity  to  form  the  exterior 
(infinite)  face.  If  all  faces  are  numbered  includ¬ 
ing  the  exterior  face,  then  Euler’s  formula  (1.0) 
remains  valid. 


Figure  1.0  (a)  3-D  Polyhedron,  (b)  2-D  Planar 
embedding 

The  infinite  face  can  be  eliminated  by  describing 
the  outer  boundary  in  terms  of  boundary  edges 
which  share  exactly  one  interior  face  (interior  edges 
share  two).  We  also  consider  boundary  edges 
which  form  simple  close  !  curves  in  the  interior 
of  the  mesh.  These  curves  serve  to  describe  pos¬ 
sible  objects  embedded  in  the  mesh  (in  this  case, 
the  polygon  which  they  form  is  not  counted  as  a 
face  of  the  mesh).  The  number  of  these  polygons 
is  denoted  by  n(h).  The  modified  Euler’s  formula 
now  reads 

n(f)  ~r  n(v,  =  u(e)  41-  n(h)  (1.1) 

Since  interior  edges  share  t,wo  faces  and  boundary 
edges  share  one  face,  the  number  of  interior  and 
boundary  edges  can  be  related  to  the  number  of 
faces  by  the  following  formula: 

max  d(f) 

^n(^)  interior  +  rt(e)bound  ~~  ^  ]  *  ,,(/)i  (1-2) 

i=3 

where  n(/),  denotes  the  number  of  faces  of  a  par¬ 
ticular  edge  degree,  d(f)  =  i.  Note  that  for  pure 
triangulations  T,  these  formulas  can  be  used  to 
determine,  independent  of  the  method  of  triangu¬ 
lation,  the  number  of  triangles  or  edges  given  the 
number  of  vertices  n(v),  boundary  edges  n(e)boun<l> 
and  interior  holes  n(h) 

n(f)3  =  2 n(v)  -  n(e)bound  -  2  4-  2 n(h)  (1.3) 

or 


n(e)  =  3n(u)  -  n(e)b„ui,d  -  3  +  3 n(h). 


(1.4) 
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This  is  a  wall  known  result  for  planar  triangu¬ 
lations.  (For  brevity,  we  will  sometimes  use  N 
to  denote  n(v)  in  the  remainder  of  these  notes.) 
In  many  cases  boundary  edges  are  not  explicitly 
given  and  the  boundary  is  taken  to  be  the  con¬ 
vex  hull  of  the  vertices,  i.e.  the  smallest  con¬ 
vex  polygon  surrounding  the  vertices.  (To  obtain 
the  convex  bull  in  two  dimensions,  envision  plac¬ 
ing  an  elastic  rubber  band  around  the  cloud  of 
points.  The  final  shape  of  this  rubber  band  will 
be  the  convex  hull.)  A  key  observation  for  planar 
meshes  is  the  asymptotic  linear  storage  require¬ 
ment  with  respect  to  the  number  of  vertices  for 
u  '•bitrary  mesh  arrangements. 

The  Euler  formula  extends  naturally  to  an 
arbitrary  number  of  space  dimensions.  In  this 
general  setting,  Euler’s  formula  relates  basic  com¬ 
ponents  (vertices,  edges,  faces,  etc.)  of  higher 
dimensional  polytopes.  In  computational  geome¬ 
try  jargon,  vertices,  edges,  and  faces  are  all  spe¬ 
cific  examples  of  “k- faces”.  A  0-face  is  simply  a 
vertex,  a  1-fac  corresponds  to  a  edge,  a  2-face 
corresponds  to  a  facet  (or  simply  a  face),  etc. 
A  polytope  V  in  RJ  contains  k-faces  V  k  € 

(  — 1,0,1,...,  c/} .  The  -l-face  denotes  the  null  set 
face  by  standard  convention.  Let  the  number  of 
k-faces  contained  in  the  polytope  V  be  denoted 
by  Nk(V).  For  example,  Ao('P)  would  denote  the 
number  of  vertices.  Using  this  notation,  we  have 
the  following  relat  ionships: 

No  =  n(v)  (vertices),  Nx  =  n(e)  (edges) 

Ni  =  11(f)  (faces),  N3  =  n(d>)  (volumes) 

By  convention,  there  is  exactly  one  null  set  con¬ 
tained  in  any  polytope,  i.e.  N-i('P)  —  l  and  by 
definition  JVj(P)  =  1.  Using  these  results,  we  can 
succinctly  state  the  general  Euler  formula  for  an 
arbitrary  polytope  in  R'1 

<1 

£  (-1  )kNh(V)  ^  0  (Euler's  Formula  in  Rd) 

t-.-i 

(1.5) 

On  the  surface  of  a  polyhedron  in  3-space,  tl  a 
standard  Euler  formula  ( 1 .0)  is  recovered  since 

-1  -f  A0  -  JV,  +  -  1  =  0 

or 

»»(/)  +  Ti(u)  =  n(e)  +  2. 

To  obtain  results  relevant  to  three-dimensional 
unstructured  grids,  the  Euler  formula  (1.5)  is  ap¬ 
plied  to  a  four-dimensional  polytope. 

(1.6) 


This  formula  relates  the  number  of  vertices,  edges, 
faces,  and  volumes  n(<f>)  of  a  three-dimensional 
mesh.  As  in  the  two-dimensional  case,  this  for¬ 
mula  does  not  account  for  boundary  effects  be¬ 
cause  it  is  derived  by  looking  at  a  single  four- 
dimensional  poly  tope.  The  example  below  demon¬ 
strates  how  t.o  derive  exact  formulas  including 
boundary  terms  for  a  tetrahedral  mesh.  Deriva¬ 
tions  valid  foi  more  general  rneshes  in  three  di¬ 
mensions  are  also  possible. 

Example:  Derivation  of  Exact  Euler  Formula  for 
3-D  Tetrahedral  Mesh. 

Consider  the  collection  of  volumes  incident 
to  a  single  vertex  v,  and  the  polyhedron  which 
describes  the  shell  formed  by  these  vertices.  Let 
Fb(vi)  and  Nb(vi)  denote  the  number  of  feces  and 
vertices  respectively  of  this  polyhedron  which  ac¬ 
tually  lie  on  the  boundary  of  the  entire  mesh. 
Also  let  E(vi)  denote  the  total  number  of  edges 
on  the  polyhedron  surrounding  Finally,  let 
d^(vi)  and  rfe(t>; )  denote  the  number  of  tetrahe¬ 
dral  volumes  and  edges  respectively  that  are  in¬ 
cident  to  v;.  On  this  polyhedron,  we  have  exact 
satisfaction  of  Euler’s  formula  (1.0),  i.e. 

polyhedral  face*:  ti«r!icea  on  polyhedron 

<*«(»/<)  +  Fb(Vi)  -f  de(Vi)  +  Nb{i>i)  =  E{vi )  +  2 

(1.7) 

Note  that  this  step  assumes  that  the  polyhedron 
is  lioineomorphic  to  a  sphere  (otherwise  Euler’s 
formula  fails).  In  reality,  this  is  not  a  severe  as¬ 
sumption.  (It  would  preclude  a  mesh  consisting 
of  two  tetrahedra  which  touch  at  a  single  vertex.) 
On  the  polyhedron  wc  also  have  that 

polyho<)r\l  fac<?y 

2 E(vi)  =  3 \d^Vi)l  lb(v^ .  (1.8) 

Combining  (1.7)  and  (1.8)  yields 

djvi)  =  ^ Fb(vi )  -  Nb{ui)  +  2.  (1.9) 

Summing  this  equation  over  all  vertices  produces 

den(v)  =  ^  ^rf*n(«)  +  ~  YlNh(v'> 

(110) 

where  <le  and  are  the  average  vertex  degrees 
with  respect  to  edges  and  volumes.  Since  globally 
we  have  that 

drn(v)  =  2?*(e),  d#n{v)  =  4  n(</>),  (1.11) 


«(/)  +  n(v)  =  n(c)  +  n{<}>) 
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substitution  of  (1.11)  into  (1.10)  reveals  that 

»(e)  =  n(<p)  +  n(v)  +  ~  Y1  Ft>(Vi )  “  \  Y1 Nb^  ' 

(1.12) 

Finally,  note  that  £^fc(w«)  =  3 n(f)bound.  Insert¬ 
ing  this  relationship  into  (1.12)  yields 

3  1 

n(e)  =  n(4>)  +  n(v)  +  ^n(/)(.o«.u/  -  -n(v)bounA 

(1.13) 

Other  equivalent  formulas  are  easily  obtained  by 
combining  this  equation  with  the  formula  relating 
volume,  faces,  and  boundary  faces,  i.e. 

»(/)  =  2  n{<f>)  +  ~n(f)t„»nd  (1.14) 

An  exact  formula,  similar  to  (1.6),  is  obtained  by 
combining  (1.14)  and  (1.13) 

n(e)+7j(</>)=n(/)+n(w)+^n(/)<,0„nrf-^n(w)6oun</ 

(1.15) 


1.2  Duality 

Given  a  planar  graph  G,  we  informally  de¬ 
fine  a  dual  graph  Gouai  to  be  any  graph  with  the 
following  three  properties:  each  vertex  of  G putli 
is  associated  with  a  face  of  G;  each  edge  of  G 
is  associated  with  an  edge  of  G  ouai\  if  an  edge 
separates  two  faces,  /,•  and  fj  of  G  then  the  asso¬ 
ciated  dual  edge  connects  two  vertices  of  Gouai 
associated  with  /,  and  f} .  This  duality  plays  an 
important  role  in  CFD  algorithms. 


Figure  1.1  Several  triangulation  duals. 

In  figure  1.1,  edges  and  faces  about  the  cen¬ 
tral  vertex  are  shown  for  duals  formed  from  me¬ 
dian  segments,  centroid  segments,  and  by  Dirich- 
let  tessellation.  (The  Diriclilet  tessellation  of  a 
set  of  points  is  a  pattern  of  convex  regions  in  the 
plane,  each  region  being  the  portion  of  the  plane 
closer  to  some  given  point  P  of  the  set  of  points 


than  to  any  other  point.)  Two-dimensional  finite- 
volume  schemes  for  the  Euler  and  Navier-Stokes 
equations  are  frequently  developed  which  form 
control  volumes  from  either  faces  (cells)  of  the 
mesh  or  faces  of  the  mesh  dual.  Schemes  which 
use  the  cells  of  the  mesh  as  control  volumes  are 
often  called  “cell-centered”  schemes.  Other  “ver¬ 
tex”  schemes  use  mesh  duals  constructed  from 
median  segments,  Dirichlet  regions,  or  centroid 
segments.  In  all  of  these  schemes,  the  primary 
computational  effort  is  associated  with  the  cal¬ 
culation  of  the  flux  of  mass,  momenta,  and  en¬ 
ergy  through  an  edge  associated  with  the  con¬ 
trol  volume.  The  one-to-one  correspondence  of 
edges  of  a  mesh  and  mesh  dual  (ignoring  bound¬ 
aries)  means  that  there  is  very  little  difference  in 
computational  effort  in  schemes  based  on  mesh 
faces  or  duals.  This  observation  is  not  true  in 
three  dimensions!  Consider  a  three  dimensional 
tetrahedral  mesh.  The  duality  for  this  nonpla- 
nar  arrangement,  is  between  edges  of  the  tetrahe¬ 
dral  mesh  and  faces  of  the  dual.  In  other  words, 
for  each  edge  of  the  mesh  there  is  a  one-to-one 
correspondence  with  a  face  of  the  dual  (ignor¬ 
ing  boundaries)  Again,  the  main  computational 
effort  associated  with  finite-volume  schemes  for 
solving  the  Euler  and  Navier-Stokes  equations  is 
the  calculation  of  the  flux  through  each  face  of 
the  control  volume.  If  the  control  volumes  are 
the  tetrahedra  themselves  (cell-centered  scheme), 
then  a  flux  must  be  calculated  for  each  tetrahe¬ 
dral  face.  This  means  that  the  work  is  propor¬ 
tional  to  the  number  of  faces  of  the  tetrahedral 
mesh.  FYom  equ.  (1.14),  the  number  of  faces  of 
a  tetrahedral  mesh  is  related  to  the  number  of 
tetrahedra  and  boundary  fcices  by 


lCOrfcc_c  scheme  n(/)  —  2n(flf>)  -f-  t^1l(f)boun<l- 

If  the  control  volumes  of  the  finite- volume  scheme 
are  formed  from  a  mesli  dual  (vertex  scheme), 
then  the  number  of  flux  calculations  is  propor¬ 
tional  to  the  number  of  faces  of  the  mesh  dual 
which  is  roughly  equal  to  the  number  of  edges  of 
the  original  tetrahedral  mesh.  From  eqn.  (1.13) 
we  have  that 

workvcrt  schr.,nc  *  n(e)  =  n((f>)  +  n(v) 

3  1 

4" ^ll{f)bound  ~  2^(^)&oum«/ 

To  better  understand  the  work  estimates,  define 
ft  such  that  n{<f>)  =  ftn(v).  Practically  speaking, 
ft  usually  ranges  from  5-7  for  tetrahedral  meshes. 
Taking  the  ratio  of  the  work  estimates  for  the 
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cell-centered  and  vertex  scheme,  ignoring  bound¬ 
ary  terms  and  assuming  an  identical  constant  of 
proportionality,  we  obtain 

workc — £  scheme  _  2/3n(u)  _  2/?  If") 

wovkrcrt  scheme  (1  "f“  /3")tl(v)  1  “h  P 

The  work  for  the  cell-centered  scheme  approaches 
twice  that  of  the  vertex  scheme.  The  reader  should 
not  automatically  infer  that  the  vertex  scheme  is 
preferred.  The  question  of  solution  accuracy  of 
the  two  approaches  needs  to  be  factored  into  the 
equation.  The  answer  to  the  question  of  which  is 
“better”  is  still  a  subject  for  debate. 

1.3  Data  Structures 

The  choice  of  data  structures  used  in  rep¬ 
resenting  unstructured  grids  varies  considerably 
depending  on  the  type  of  algorithmic  operations 
to  be  performed.  In  this  section,  a  few  of  the 
most  common  data  structures  will  be  discussed. 
The  mesh  is  assumed  to  have  a  numbering  of  ver¬ 
tices,  edges,  faces,  etc.  In  most  cases,  the  physical 
coordinates  are  simply  listed  by  vertex  number. 
The  “standard”  finite  element  (FE)  data  struc¬ 
ture  lists  connectivity  of  each  element.  For  ex¬ 
ample  in  figure  1.2(a),  a  list  of  the  three  vertices 
of  each  triangle  would  be  given. 


(•)  (b) 


Figure  1.2  Data  structures  for  planar  graphs, 
(a)  FE  data  structure,  (b)  Edge  structure,  (c) 
Out-degree  structure,  (d)  Quad-edge  structure. 

The  FE  structure  extends  naturally  to  three  di¬ 
mensions.  The  FE  structure  is  used  extensive  in 
finite  element,  solvers  for  solids  and  fluids. 

For  planar  meshes,  another  typical  structure 
is  the  edge  structure  (figure  1.2(b))  which  lists 


connectivity  of  vertices  and  adjacent  faces  by  stor¬ 
ing  a  quadruple  tor  each  edge  consisting  of  the 
origin  and  destination  of  the  each  edge  as  well  as 
the  two  faces  (cells)  that  share  that  edge.  This 
structure  allows  easy  traversal  through  a  mesh 
which  can  be  useful  in  certain  grid  generation  al¬ 
gorithms.  (This  traversal  is  not  easily  done  using 
the  FE  structure.)  The  extension  to  three  di¬ 
mensions  is  facewisc  (vertices  of  a  face  are  given 
as  well  as  the  two  neighboring  volumes)  and  re¬ 
quires  distinction  between  different  face  types. 

A  third  data  structure  provides  connectivity 
via  vertex  lists  as  shown  in  figure  1.2(c).  The 
brute  force  approach  is  to  list  all  adjacent  neigh¬ 
bors  for  each  vertex  (usually  as  a  linked-list).  Many 
sparse  matrix  solver  packages  specify  nonzeros  of 
a  matrix  using  row  or  column  storage  schemes 
which  list  all  nonzero  entries  of  a  given  row  or 
column.  For  discretizations  involving  only  adja¬ 
cent  neighbors  of  a  mesh,  this  would  be  identical 
to  specifying  a  vortex  list.  An  alternative  to  spec¬ 
ifying  all  adjacent  neighbors  is  to  direct  edges  of 
the  mesh.  In  this  case  only  those  edges  which 
point  outward  from  a  vertex  are  listed.  In  the 
next  section,  it  will  be  shown  that  an  out-degree 
list  can  be  constructed  for  planar  meshes  by  di¬ 
recting  a  graph  such  that  no  vertex  has  more  than 
three  outgoing  edges.  This  is  asymptotically  op¬ 
timal.  The  extension  of  the  out-degree  structure 
to  three  dimensions  is  not  straightforward  and  al¬ 
gorithms  for  obtaining  optimal  edge  direction  for 
nonplanar  graphs  are  still  under  development. 

The  last  structure  considered  here  is  the  quad- 
edge  structure  proposed  by  Guibas  and  Stolfi  [1], 
see  figure  1  2(d).  Each  edge  is  stored  as  pair  of 
directed  edges.  Each  of  the  directed  edges  stores 
its  origin  and  pointers  to  the  next  and  previous 
directed  edge  of  the  region  to  its  left.  The  quad- 
edge  structure  is  extremely  useful  in  grid  gener¬ 
ation  where  distinctions  between  topological  and 
geometrical  aspects  are  sometimes  crucial.  The 
structure  has  been  extended  to  three  dimensional 
arrangements  by  Dobkiu  and  Laslo  [2]  and  Bris- 
son  [3]. 

2.0  Some  Basic  Graph  Operations 
Important  in  CFD 

Implementation  of  unstructured  grid  meth¬ 
ods  on  differing  computer  architectures  las  stim¬ 
ulated  research  in  exploiting  properties  and  char¬ 
acterizations  of  planar  and  nonplanar  graphs.  For 
example  in  Hammond  and  Barth  [4],  we  exploited 
a  recent  theorem  by  Chrobak  and  Eppstein  (5) 
concerning  directed  graphs  with  minimum  out- 
degree.  In  this  section,  we  review  this  result  as 


well  as  presenting  other  basic  graph  operations 
that  are  particularly  useful  in  CFD.  Some  of  these 
algorithms  are  specialized  to  planar  graphs  (2-D 
meshes)  while  others  are  very  general  and  apply 
in  any  number  of  space  dimensions. 

2.1  Planar  Graphs  with  Minimum  Out-Degree 
Theorem:  Every  planar  graph  has  a  3- bounded 
orientation,  [5j. 

In  other  words,  each  edge  of  a  planar  graph 
can  be  assigned  a  direction  such  that  the  maxi¬ 
mum  number  of  edges  pointing  outward  from  any 
vertex  is  less  than  or  equal  to  three,  A  construc¬ 
tive  proof  is  given  in  ref.  [5]  consisting  of  the  fol¬ 
lowing  steps.  The  first  step  is  to  find  a  reduceable 
boundary  vertex.  A  rcduceablc  boundary  vertex 
is  any  vertex  on  the  boundary  with  incident  ex¬ 
terior  (boundary)  edges  that  connect  to  at  most 
two  other  boundary  vertices  and  any  number  of 
interior  edges.  Chrobak  and  Eppstein  prove  that 
reduceable  vertices  can  always  be  found  for  ar¬ 
bitrary  planar  graphs.  (In  fact,  two  reduceable 
vertices  can  always  be  found!)  Once  a  rcduce- 
able  vertex  is  found  then  the  two  edges  connecting 
to  the  other  boundary  vertices  are  directed  out¬ 
ward  and  the  remaining  edges  are  always  directed 
inward.  These  directed  edges  are  then  removed 
from  the  graph,  see  Figures  2.0(a-j),  The  process 
is  then  repeated  on  the  remaining  graph  until  no 
more  edges  remain.  The  algorithm  shown  picto- 
rially  in  figure  2.0  is  summarized  in  the  following 
steps: 

Algorithm:  Orient  a  Graph  with  maximum  out- 
degree  <  3. 

Step  1.  Find  reduceable  boundary  vertex. 

Step  2.  Direct  exterior  edges  outward  and  interior 
edges  inward. 

Step  3.  Remove  directed  edges  from  graph. 

Step  4-  If  undirected  edges  remain,  go  to  step  1 


Figure  2.0  (a-i)  Mesh  orientation  procedure  with 
out-degree  3,  (j)  final  oriented  triangulation. 

Linear  time  algorithms  are  given  in  [5],  In 
the  paper  by  Hammond  and  Barth,  we  exploit 
the  out-degree  property  to  provide  optimal  load 
balancing  on  a  massively  parallel  computer.  De¬ 
tails  are  given  in  a  later  section. 


2.2  Graph  Ordering  Techniques 

The  particular  ordering  of  solution  unknowns 
can  have  a  marked  influence  on  the  amount  of 
computational  effort  and  memory  storage  required 
to  solve  large  sparse  linear  systems  and  eigen¬ 
value  problems.  In  many  algorithms,  the  band 
width  and/or  profile  of  the  matrix  determines  the 
amount  of  computation  and  memory  required. 
Most  meshes  obtained  from  grid  generation  codes 
have  very  poor  natural  orderings.  Figures  2.1  and 

2.2  show  a  typical  mesh  generated  about  a  multi- 
component  airfoil  and  the  nonzero  entries  asso¬ 
ciated  with  the  “Laplacian”  of  the  graph.  The 
Laplacian  of  a  graph  would  represent  the  nonzero 
entries  due  to  a  discretization  which  involves  only 
adjacent  neighbors  of  the  mesh.  Figure  2.2  in¬ 
dicates  that  the  band  width  of  the  natural  or¬ 
dering  is  almost  equal  to  the  dimension  of  the 
matrix!  In  parallel  computation,  the  ordering  al¬ 
gorithms  can  be  used  as  means  for  partitioning 


a  mesh  among  processors  of  the  computer.  This 
will  be  addressed  in  the  next  section. 


Figure  2.1  Typical  Steiner  triangulation  about 
multi-component  airfoil. 


Figure  2.2  Nonzero  entries  of  Laplacian  matrix 
produced  from  natural  ordering. 

Several  algorithms  exist  which  construct  new 
orderings  by  attempting  to  minimize  the  band 
width  of  a  matrix  or  attempting  to  minimize  the 
fill  that  occurs  in  the  process  matrix  factoriza¬ 
tion.  These  algorithms  usually  rely  on  heuristics 
to  obtain  high  efficiency,  and  do  not  usually  ob¬ 
tain  an  optimum  ordering.  One  example  would 
be  Rosen’s  algorithm  [6]  which  iterates  on  the  or¬ 
dering  to  minimize  the  maximum  band  width. 


Algorithm;  Graph  ordering,  Rosen. 

Step  t.  Determine  band  width  and  the  defining 
index  pair  (* ,  j)  with  (*'  <  i) 

Step  2.  Does  their  exist  an  exchange  which  in¬ 
creases  t  or  decreases  j  so  that  the  band  width  is 
reduced?  If  so,  exchange  and  go  to  step  1 
Step  3.  Does  their  exist,  an  exchange  which  in¬ 
creases  i  or  decreases  j  so  that  the  band  width 
remains  the  same?  If  so,  exchange  and  go  to  step 
1 

This  algorithm  produces  very  good  orderings 
but  can  be  very  expensive  for  large  matrices.  A 
popular  method  which  is  much  less  expensive  for 
large  matrices  is  the  Cnthill-McKoc  [7]  algorithm. 

Algorithm:  Graph  ordering,  Cuthill-McKec. 

Step  1.  Find  vertex  with  lowest  degree.  This  is 
the  root  vertex. 

Step  2.  Find  all  neighboring  vertices  connecting 
to  the  root  by  incident  edges.  Order  them  by 
increasing  vertex  degree.  This  forms  level  1. 

Step  3.  Form  level  k  by  finding  all  neighboring; 
vertices  of  level  k  -  1  which  have  not  been  pre¬ 
viously  ordered.  Order  these  new  vertices  by  in¬ 
creasing  veil  ex  degree. 

Step  4-  W  vertices  remain,  go  to  step  3 


Figure  2.3  Nonzero  entries  of  Laplacian  matrix 
after  Cuthill-McKee  ordering. 

The  heuristics  behind  the  Cuthill-McKec  al¬ 
gorithm  are  very  simple.  In  the  graph  of  the 
mesh,  neighboring  vertices  must  have  numberings 
which  are  near  by,  otherwise  they  will  produce  en¬ 
tries  in  the  matrix  with  large  band  width.  The 
idea  of  sorting  elements  among  a  given  level  is 
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based  on  the  heuristic  that  vertices  with  high  de¬ 
gree  should  be  given  indices  as  large  as  possible 
so  that  they  will  be  as  close  as  possible  to  vertices 
of  the  next  level  generated,  Figure  2.3  shows  the 
dramatic  improvement  of  the  Cuthill-McKce  or¬ 
dering  on  the  matrix  shown  in  figure  2.2. 

Studies  of  the  Cuthill-McKee  algorithm  have 
shown  that  the  profile  of  a  matrix  can  be  greatly 
reduced  simply  by  reversing  the  ordering  of  the 
Cuthill-McKee  algorithm,  see  George  [8].  This 
amounts  to  a  renumbering  given  by 

k  — ♦  n  —  k  +  1  (2.1) 

where  n  is  the  size  of  the  matrix.  While  this 
does  not.  change  the  bandwidth  of  tiie  matrix, 
it  can  dramatically  reduce  the  fill  that  occurs  in 
Cholcsky  or  L-U  matrix  factorization  when  com¬ 
peared  to  the  original  Cuthill-McKee  algorithm. 


2.3  Graph  Partitioning  for  Parallel  Computing 

An  efficient,  partitioning  of  a  mesh  for  dis¬ 
tributed  memory  machines  is  one  that  ensures 
an  tovtii  distribution  of  computational  workload 
among  the  processors  and  minimizes  the  amount 
of  time  spent  in  interproeessor  communications. 
The  former  requirement  is  termed  load  balanc¬ 
ing.  For  if  the  load  were  not  evenly  distributed, 
some  processors  will  have  to  sit  idle  at  synchro¬ 
nization  points  waiting  for  other  processors  to 
catch  up  The  second  requirement  conies  from 
the  fact  that  communication  between  processors 
takes  time  and  it.  is  not  always  possible  to  hide 
this  latency  in  data  transfer.  In  our  parallel  im¬ 
plementation  of  a  finite- volume  flow  solver  ou  un¬ 
structured  grids,  data  for  the  nodes  that  lie  on  the 
boundary  between  two  processors  is  exchanged, 
hence  requiring  a  bidirectional  data- transfer.  On 
many  systems,  a  synchronous  exchange  of  data 
can  yield  a  higher  performance  than  when  done 
asynchronously.  To  exploit  this  fact,  edges  of 
the  communication  graph  are  colored  such  that 
no  vertex  has  more  than  one  edge  of  a  certain 
color  incident  upon  it.  A  communication  graph 
is  a  graph  in  which  the  vertices  are  the  proces¬ 
sors  and  an  edge  connects  two  vertices  if  the  two 
corresponding  processors  share  an  interproeessor 
boundary.  The  colors  in  the  graph  represent  sep¬ 
arate  communication  cycles.  For  instance,  the 
mesh  partitioned  amongst  four  processors  as  shown 
in  figure  2.4(a),  would  produce  the  communica¬ 
tion  graph  shown  in  figure  2.4(h). 


Figure  2.4  (a)  Four  partition  mesh,  (b)  Commu¬ 
nication  graph. 

The  graph  shown  in  figure  2.4(b)  can  be  colored 
edgewise  using  three  colors.  For  example,  in  the 
first  communication  cycle,  processors  (1, 4)  could 
perform  a  synchronous  data  exchange  as  would 
processors  (2,3).  In  the  second  communication 
cycle,  processors  (1,2)  and  (3,4)  would  exchange 
and  in  the  third  cycle,  processors  (1,3)  would  ex¬ 
change  while  processors  2  and  4  sit.  idle.  Viiing’s 
theorem  proves  that  any  graph  of  maximum  ver¬ 
tex  degree  A  (number  of  edges  incident  upon  a 
vertex)  can  be  colored  using  n  colors  such  that 
A  <  «  <  A  4-  1.  Hcncc,  any  operation  that 
calls  for  every  processor  to  exchange  data  with  its 
neighbors  will  require  n  communication  cycles. 

The  actual  cost  of  communication  can  often 
be  accurately  modeled  by  the  linear  relationship: 

Cost  =  « -f-  [dm  (2.2) 

where  a  is  the  time  required  to  initiate  a  mes¬ 
sage,  /?  is  the  rate  of  data-transfer  between  two 
processors  and  m  is  tiie  message  length.  For  n 
messages,  the  cost  would  be 

Cost  =  5^(a  +  0mn).  (2.3) 

If 

This  cost  can  be  reduced  in  two  ways.  One  way  is 
to  reduce  A  thereby  reducing  n.  The  alternative 
is  to  reduce  the  individual  message  lengths.  The 
bounds  on  n  are  2  <  N  <  P—  1  for  P  >  3  where  P 
is  the  total  number  of  processors.  Figure  2.5(a) 
shows  the  partitioning  of  a  mesh  which  reduces 
A,  and  2.5(b)  shows  a  partitioning  which  mini¬ 
mizes  the  message  lengths.  For  the  mesh  in  figure 
2.5(a),  A  =  2  while  iti  figure  2.5(b),  A  =  3.  How¬ 
ever,  the  average  message  length  for  the  parti¬ 
tions  shown  in  figure  2.5(b)  is  about  half  as  much 
as  that  in  figure  2.5(a). 
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Figure  2.5  (a)  Mesh  partitioning  with  minimized 
A,  (b)  Mesh  with  minimizes  message  length. 

In  practice,  it.  is  difficult  to  partition  an  unsti* 
turocl  mesh  while  simultaneously  minimizing  the 
number  and  length  of  messages.  In  the  following 
paragraphs,  a  few  of  the  most  popular  partition¬ 
ing  algorithms  which  approximately  accomplish 
this  task  will  be  discussed.  All  the  algorithms 
discussed  below:  coordinate  bisection,  Cuthill- 
McKee,  and  spectral  partitioning  are  evaluated  in 
t  he  paper  by  Venkatakrishnan,  Simon,  and  Barth 

[9] .  This  paper  evaluates  the  partitioning  tech¬ 
niques  within  the  confines  of  an  explicit,  unstruc¬ 
tured  finite-volume  Euler  solver.  Spectral  par¬ 
titioning  has  been  extensively  studied  by  Simon 

[10] .  Tlu  algorithms  have  also  been  implemented 
in  three  dimensions  by  A.  Gandhi  working  in  the 
CFD  branch  at  NASA  Ames  Research  Center. 

Note  that,  for  the  particular  applications  that 
we  have  in  mind  (a.  finite-volume  scheme  with  so¬ 
lution  unknowns  at  vertices  of  the  mesh),  it  makes 
sense  to  partition  the  domain  such  that  the  sep¬ 
arators  correspond  to  edges  of  the  mesh.  Also 
note  that  the  p<Lii.iiaoiuiig  algorithms  all  can  be 
implemented  recursively.  The  mesh  is  first  di¬ 
vided  into  two  sub-meshes  of  nearly  equal  size. 
Each  of  those  sub-meshes  is  subdivided  into  two 
more  sub-meshes  and  the  process  in  repeated  un¬ 
til  the  desired  number  of  partitions  P  is  obtained 
(P  is  a  integer  power  of  2).  Since  we  desire  the 
separator  of  the  partitions  to  coincide  with  edges 
of  tho  mesh,  the  division  of  a  sub-mesh  into  two 
pieces  can  be  viewed  as  a  2-coloring  of  faces  of  the 
sub-mesh.  For  the  Cuthill-McKee  and  spectral 
partitioning  techniques,  this  amounts  to  supply¬ 
ing  these  algorithms  with  the  dual  of  the  graph 
for  purposes  of  the  2-coloring.  The  balancing  of 
each  partition  is  usually  done  cellwisc;  although 
an  edgewise  balancing  is  more  appropriate  in  the 
present  applications.  Due  to  the  recursive  na¬ 
ture  of  partitioning,  the  algorithms  outlined  be¬ 
low  represent  only  a  single  step  of  the  process. 


Cooi'dinatc  Bisection 

In  the  coordinate  bisection  algorithm,  face 
centroids  are  sorted  either  horizontally  or  verti¬ 
cally  depending  of  the  current  level  of  the  recur¬ 
sion.  A  separator  is  chosen  which  balances  the 
number  of  faces.  Faces  are  colored  depending  on 
which  side  of  the  separator  they  are  located.  The 
actual  edges  of  the  mesh  corresponding  to  the 
separator  are  characterized  as  those  edges  which 
have  adjacent  faces  of  different  color,  see  figure 
2.6.  This  partitioning  is  very  efficient  to  cre¬ 
ate  but  gives  sub-optimal  performance  on  parallel 
computations  owing  to  the  long  message  lengths 
than  can  routinely  occur. 


Figure  2.6  Coordinate  bisection  (16  partitions). 
Cuthill-McKee 

The  Cuthill-McKee  (CM)  algorithm  described 
earlier  can  also  be  used  for  recursive  mesh  parti¬ 
tioning.  In  this  case,  the  Cuthill-McKee  order¬ 
ing  is  performed  on  the  dual  of  the  mesh  graph. 
A  separator  is  chosen  either  at  the  median  of 
the  ordering  (which  would  balance  the  coloring 
of  faces  of  the  original  mesh)  or  the  separator  is 
chosen  at  the  level  set  boundary  closest  to  the 
median  as  possible.  This  latter  technique  has  the 
desired  effect  of  reducing  the  number  of  discon¬ 
nected  sub-graphs  that  occur  during  the  course 
of  the  partitioning.  Figure  2.7  shows  a  Cuthill- 
McKee  partitioning  for  the  multi-component  air¬ 
foil  mesh.  The  Cuthill-McKee  ordering  tends  to 
produce  long  boundaries  because  of  the  way  that 
the  ordering  is  propagated  through  a  mesh.  The 
maximum  degree  of  the  communication  graph  also 
tends  to  be  higher  using  the  Cuthill-McKee  algo¬ 
rithm.  The  results  shown  in  ref.  [9]  for  multi¬ 
component  airfoil  grids  indicate  a  performance  on 


parallel  computations  which  is  slightly  worse  than 
the  coordinate  bisection  technique. 


Figure  2.7  Cuthill-McKee  partitioning  of  mesh. 
Speet.ml  Partitioning 

The  last  partitioning  considered  is  the  spec¬ 
tral  partitioning  which  exploits  properties  of  the 
Laplaciau  C  of  a  graph  (defined  below).  The  al¬ 
gorithm  consists  of  the  following  steps: 

Algorithm:  Spectral  Partitioning. 

Stop  1.  Calculate  the  matrix  C  associated  with 
the  Laplacian  of  the  graph  (dual  graph  in  the 
present  case). 

Step  2.  Calculate  the  eigenvalues  and  eigenvec¬ 
tors  of  C. 

Step  3.  Order  the  eigenvalues  by  magnitude,  Aj  < 

A«  < 

Step  4-  Determine  the  smallest  nonzero  eigen¬ 
value,  A f  and  its  associated  eigenvector  x/  (the 
Fiedler  vector). 

Step  5.  Sort  elements  of  the  Fiedler  vector. 

Step  6.  Choose  a  divisor  at  the  median  of  the 
sorted  list  and  2-color  vertices  of  the  graph  (or 
dual)  which  correspond  to  elements  of  the  Fielder 
vector  less  than  or  greater  than  the  median  value. 

The  spectral  partitioning  of  the  multi-component 
airfoil  is  shown  in  figure  2.8.  In  reference  (9), 
we  found  that  parallel  computations  performed 
slightly  better  on  the  spectral  partitioning  than 
on  the  coordinate  bisection  or  Cuthill-McKcc,  The 
cost  of  the  spectral  partitioning  is  very  high  (even 
using  a  Lanczos  algorithm  to  compute  the  eigen¬ 
value  problem).  It  has  yet  to  be  determined  if  the 
spectral  partitioning  will  have  practical  merit. 


Figure  2.8  Spectral  partitioning  of  multi-  com¬ 
ponent  airfoil. 

The  spectral  partitioning  exploits  a  peculiar  prop¬ 
erty  of  the  “second”  eigenvalue  of  the  Laplacian 
matrix  associated  with  a  graph.  The  Laplaciau 
matrix  of  a  graph  is  simply 


C  =  -D  +  A.  (2.-1) 

where  A  is  the  standard  adjacency  matrix 


Aij 


c(u,,u>)6  G 
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and  D  is  a  diagonal  matrix  with  entries  equal  to 
the  degree  of  each  vertex.  D,  —  d(v,).  From  this 
definition,  it  should  be  clear  that  rows  of  C  each 
sum  to  zero.  Define  an  A-vector,  s  =  [1, )J, 
By  construction  we  h-ve  that 


Cs  =  0.  (2.6) 

This  means  that  at  least  one  eigenvalue  is  zero 
with  s  as  an  eigenvector. 

The  objective  of  the  spectral  partitioning  is  to  di¬ 
vide  the  mesh  into  two  partitions  of  equal  size 
such  that  the  number  of  edges  cut  by  the  parti¬ 
tion  boundary  is  approximately  minimized. 

Technically  speaking,  the  smallest  nonzero 
eigenvalue  need  not  be  the  second.  Graphs  witli 
disconnected  regions  will  have  more  that  one  zero 
eigenvalue  depending  on  the  number  of  discon¬ 
nected  regions.  For  purposes  of  discussion,  v.c 
assume  that  disconnected  regions  are  not  present, 
i.e.  that  A2  is  the  relevant  cigcnmodc. 
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Elements  of  the  proof: 

Define  a  partitioning  vector  which  2-colors 
the  vertices 

|>  =  [+1,  -1,-1,+1,-U . +1.-1]7'  (2.7) 

depending  on  the  sign  of  elements  of  p  and  the 
one-to-one  correspondence  with  vertices  of  the 
graph,  see  for  example  figure  2.9.  Balancing  the 
number  of  vertices  of  each  color  amounts  to  the 
requirement-  that 

s  1  p  (2.8) 

where  wc  have  assumed  an  even  number  of  ver¬ 
tices. 


Figure  2.9  Arbitrary  graph  with  2- coloring  show¬ 
ing  separator  and  cut  edges. 

The  key  observation  is  that  the  number  of  cut 
edges,  is  precisely  related  to  the  L\  norm  of 
the  Laplacian  matrix  multiplying  the  partitioning 
vector,  i.e. 

4E,  =  ||£p||,  (2.9) 

which  can  l»e  easily  verified.  The  goal  is  to  mini¬ 
mize  cut  edges.  That  is  to  find  p  which  minimizes 
||£p||i  subject  to  the  constraints  that  ||p||i  =  IS' 
and  s  .1.  p.  Since  C  is  a  real  symmetric  (posi¬ 
tive  semi-definite)  matrix,  it  has  a  complete  set 
of  real  eigenvectors  which  can  be  orthogonali/.ed 
with  each  other.  The  next  step  of  the  proof  would 
be  to  extend  the  domain  of  p  to  include  real  num¬ 
bers  (this  introduces  an  inequality)  and  expand  p 
in  terms  of  the  orthogonal  eigenvectors. 

n 

P«5>x,-  (2.10) 

i=:l 


By  virtue  of  (2.6)  wc  have  that  xi  =  s.  It  remains 
to  be  shown  that  ||£p||i  minimized  when  p  s 
p'  —  UX2/HX2II1  ,i.c.  when  the  Fiedler  vector  is 
used.  Inserting  this  expression  for  p  we  have  that 

IU-p'IIi -”V:  (2.11) 

It  is  a  simple  matter  to  show  that  adding  any 
other  eigenvector  component  to  p'  while  insisting 
that  Up’ll  1  =  N  can  only  increase  the  norm. 
This  would  complete  the  proof.  Figure  2.10  plots 
contours  (level  sets)  of  the  Fiedler  vector  for  the 
multi-component  airfoil  problem. 


Figure  2.10.  Contours  of  Fiedler  Vector  for 
Spectral  Partitioning.  Dashed  lines  are  less  than 
the  median  value. 

3.0  Triangulation  Methods 

Although  many  algorithms  exist  for  triangu¬ 
lating  sites  (points)  in  an  arbitrary  number  of 
space  dimensions,  only  a  few  have  been  used  on 
practical  problems.  I11  particular,  Delaunay  tri- 
angulation  has  proven  to  be  a  very  useful  triangu 
iation  technique.  This  section  will  present,  some 
of  the  basic  concepts  surrounding  Delaunay  and 
related  triangulations  as  well  as  discussing  some 
of  the  most  popular  algorithms  for  constructing 
these  triangulations.  The  discussion  of  the  ad¬ 
vancing  front,  method  of  grid  generation  will  be 
deferred  to  Professors  Morgan  and  Lolmcr. 

3.1  Voronoi  Diagram  and  Delaunay  'IYinngnlation 

Recall  the  definition  of  the  Dirichlot.  tessel¬ 
lation  in  a  plane.  The  Dirichlct  tessellation  of 
a  point  set  is  the  pattern  of  convex  regions,  each 
being  closer  to  some  point  P  in  the  point  set  than 
to  any  other  point  in  the  set  .  These  Dirichlet  re¬ 
gions  are  also  called  Voronoi  regions.  The  edges 


of  Voronoi  polygons  comprise  the  Voronoi  dia¬ 
gram,  see  figure  3.1.  The  idea  extends  naturally 
to  higher  dimensions. 


Figure  3.1  Voronoi  diagram  of  40  random  sites, 

Voronoi  diagrams  have  a  rich  mathematical  the¬ 
ory.  The  Voronoi  diagram  is  believed  to  be  one  of 
the  most  fundamental  constructs  defined  by  dis¬ 
crete  data.  Voronoi  diagrams  have  been  inde¬ 
pendently  discovered  in  a  wide  variety  of  disci¬ 
plines,  Computational  geometricians  have  a  keen 
interest  in  Voronoi  diagrams.  It  is  well  known 
•hat  Voronoi  diagrams  are  related  to  convex  hulls 
via  stereographic  projection.  Point  location  in  a 
Voronoi  diagram  can  be  performed  in  0(log(/,)) 
time  with  O(u)  storage  for  »  regions.  This  is  use¬ 
ful  in  solving  post-office  or  related  problems  in 
optimal  time.  Another  example  of  the  Voronoi 
diagram  which  occurs  in  the  natural  sciences  can 
be  visualized  by  placing  crystal  “seeds”  at  ran¬ 
dom  sites  in  3-space.  I.et  the  crystals  grow  at 
the  same  rate  in  all  directions.  When  two  crys¬ 
tals  collide  simply  stop  their  growth.  The  crys¬ 
tal  formed  for  each  site  would  represent  that  vol¬ 
ume  of  space  which  is  closer  to  that  site  than  to 
any  other  site.  This  would  effectively  construct 
a  Voronoi  diagram.  We  now  consider  the  role  of 
Voronoi  diagrams  in  Delaunay  triangulation. 

Deli  nit  ion.'Tlie  Delaunay  triangulation  of  a  point 
set  is  defined  as  the  dual  of  the  Voronoi  diagram 
of  the  set. 

The  Delaunay  triangulation  in  two  space  dimen¬ 
sions  is  formed  by  connecting  two  poims  if  and 
only  if  their  Voronoi  regions  have  a  common  bor¬ 
der  segment.  If  no  four  or  more  points  are  cocir¬ 
cular,  then  we  have  that  vertices  of  the  Voronoi 
arc  circumcentcrs  of  the  triangles.  This  is  true  be¬ 


cause  vertices  of  tbe  Voronoi  represent,  lotntions 
that  are  equidistant  to  three  (or  more)  sites.  Also 
note  that  from  the  definition  of  duality,  edges 
of  the  Voronoi  are  in  one-to-one  correspondence 
to  edges  of  the  Delaunay  triangulation  (ignoring 
boundaries).  Because  edges  of  the  Voronoi  di¬ 
agram  are  the  locus  of  points  equidistant,  to  two 
sites,  each  edge  of  the  Voronoi  diagram  is  perpen¬ 
dicular  to  the  corresponding  edge  of  the  Delau¬ 
nay  triangulation.  This  duality  extends  to  three 
dimensions  in  a  straightforward  way.  The  De¬ 
launay  triangulation  possesses  several  alternate 
characterizations  and  many  properties  of  impor¬ 
tance.  Unfortunately,  not  all  of  the  two  dimen¬ 
sional  characterizations  have  three-dimensional  ex¬ 
tensions.  To  avoid  confusion,  properties  and  algo¬ 
rithms  for  construction  of  two  dimensional  Delau¬ 
nay  triangulations  will  be  considered  first.  The 
romaincr  of  this  section  will  then  discuss  the  three- 
dimensional  Delaunay  triangulation. 

3.2  Properties  of  a  2-D  Delaunay  Triangulation 

(1)  Uniqueness.  The  Delaunay  triangulation  is 
unique.  This  assumes  that  no  four  sites  are  coeir- 
eular.  The  uniqueness  follows  from  (lie  unique¬ 
ness  of  the  Diriehlet  tessellation. 

(2)  The  circumcircle  criteria.  A  triangulation  of 
N  >  2  sites  is  Delaunay  if  and  only  if  the  ciivum- 
citcle  of  every  interior  triangle  is  point-free.  For 
if  this  was  not  true,  the  Voronoi  regions  associ¬ 
ated  with  tin*  dual  would  not  be  convex  and  the 
Diriehlet  tessellation  would  be  invalid.  Belated 
to  the  circumoircle  <  ritoria  is  the  incircle  test,  for 
four  points  as  shown  in  figures  3. 2-3. 3. 


Figure  3.2  Incircle  test  for  A  ABC  and  D  (true). 


Figure  3.3  Incircle  test  for  £AABC  and  D  (false). 


Tints  test.  is  t rue  if  point  D  lies  interior  to  tin-  <  ir- 
cutucirclc  of  A  ADC  which  is  equivalent  to  testing 
whether  LABC  LCD  A  is  less  than  or  greater 
than  L BCD  -j-  /.HAD.  Mon-  precisely  we  ha- 
that 

{<  180°  incireie  false 

180°  A.ll.C.D  cocircular 
>  180°  incircle  true 

(5.1) 

Since  interior  angles  of  the  quadrilateral  sum  to 
360°,  if  the  eircumeirclo  of  A  ABC  contains  D 
then  swapping  the  diagonal  edge  front  position 
,-1-C  into  B  —  D  guarantees  that  the  new  truing1, 
pair  satisfies  the  eircuincirclo  criteria,  Furtlie'r- 
more,  this  process  of  diagonal  swapping  is  local, 

i.e.  it  does  not.  disrupt,  the  Delautmvhood  of  any 
triangles  adjacent  to  the  quadrilateral. 

(8 ) Edge  circle  property.  A  triangulation  of  sites 
is  Delaunay  if  and  only  if  there  exists  some  circle 
passing  through  the  endpoints  of  each  and  every 
edge  which  is  point-free.  This  characterization  is 
very  aseftd  because  it  also  provides  a  mechanism 
for  defining  a  constrained  Delaunay  trinngulntion 
where  certain  edges  are  prescribed  a  priori.  A  tri- 
angulation  of  sites  is  n  constrained  Delaunay  tri¬ 
angulation  if  for  each  and  every  edge  of  the  mesh 
there  exists  some  circle  passing  through  its  end¬ 
points  containing  no  other  site  in  the  triangula¬ 
tion  which  is  visible  to  the  edge.  In  figure  3.1.  site 
d  is  not  visible  to  the  segment  a-c  because  of  the 
const  rained  edge  a-b. 


Figure  3.4  Constrained  Delaunay  triangulation. 
Site  <1  is  not  visible  to  a-c  due  to  constrained  seg¬ 
ment  a-b. 

(  t )h\>ui<m<jul<irity  property.  Delaunay  triangula- 
t  ion  maximizes  t  he  minimum  angle  of  the  triangii- 
iatioti.  For  t  his  reason  Delaunay  triuiigululiou  of¬ 
ten  called  the  MaxMin  Inaiigulahou.  This  prop¬ 
erty  is  also  locally  true  foi  all  adjacent  triangle 
pairs  which  form  a  convex  quadrilateral.  This  is 
the  basis  for  the  local  edge  swapping  algorithm  of 
Lawson  ( 1 1 ]  described  below. 


(5 )Minimum  Containment  Circle.  A  recent  result 
by  Rajiut  [12]  shows  that  the  Delaunay  triangula- 
tiott  minimizes  the  maximum  containment  circle 
over  the  entire  trinngulation.  The  containment 
circle  is  defined  as  the  smallest  circle  enclosing 
the  three  vertices  of  a  triangle.  This  is  identical 
to  die  circmm  hole  for  ac  ute  triangles  and  a  til¬ 
de  with  diameter  equal  to  the  longest,  side  of  t  he 
triangle  for  obtuse  triangle's  (see  figure  3.5). 


Figure  3.5  Containment  c  ircles  for  acute  and  ob¬ 
tuse  triangles. 

This  property  extends  to  n  dimensions.  Unfortu¬ 
nately,  the  result  docs  not  hold  lexicographically; 

(6)  A 'rarest  neighbor  property.  An  edge  formed 
by  joining  a  vertex  to  its  nearest  neighbor  is  an 
edge  of  the  Delaunay  trinngulation.  This  prop¬ 
erty  make's  Delaunay  triangulat  ion  a  powerful  too 
in  solving  the-  closest  proximity  pi  blem.  Note 
that  the  nearest  neighbor  edges  do  not  describe 
all  e'dge's  of  the  Delaunay  triangulation. 

(1) Minimal  roughness.  The  Delaunay  triangula¬ 
tion  is  a  minima!  roughness  tnangulatum  for  ar¬ 
bitrary  sets  of  scattered  data,  Kippa  [13].  Giver 
arbitrary  data  f,  at  all  wrtiee'x  of  t  he  tne'sli  and  n 
triangulation  of  these  points,  a  unique  pie'cewisi 
linear  interpolating  surface'  can  bo  constructed 
The'  Doiiuiuay  triangulation  has  the  property  that 
of  all  triangulatious  it  minimizes  t  he  roughness  ol 
this  surface  as  measured  by  the  following  Sobolev 
.semi-norm: 


This  is  a  interesting  result  as  it.  does  not  depone 
e»u  the  actual  form  of  the  data.  This  also  indi¬ 
cates  that  Delaunay  triangulation  approximate: 
well  those  functions  whic  h  minimize  this  Sobolev 
norm.  One  example  vvoulel  be  the  harmonic  func 
lions  satisfying  Laplace’s  equation  with  suitable 
boundary  conditions  which  minimize  exactly  t.hi: 
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norm.  In  a  later  section,  we  will  prove  that  a 
Delaunay  triangulatiou  guarantees  a  maximum 
principle  for  the  discrete  Laplacian  approxima¬ 
tion  (with  linear  elements). 

3.3  Algorithms  for  2-D  Delaunav  Trianmiiation 

We  now  consider  several  techniques  for  De¬ 
launay  triangulation  in  two  dimensions.  These 
methods  were  chosen  because  they  perform  opti¬ 
mally  in  rather  different  situations.  The  discus¬ 
sion  of  the  2-D  algorithms  is  organized  as  follows: 

(a)  Incremental  Insertion  Algorithms 

(i)  Bowyer  algorithm 

(ii)  Wavson  algorithm 

(iii)  Green  and  Sibson  algorithm 

(b)  Divide  and  Conquer  Algorithm 

(c)  Tanemura/Merriam  Algorithm 

(d)  Global  Edge  Swapping  (Lawson) 

It  should  be  pointed  out  that  there  appears  to  be 
some  confusion  in  the  CFD  literature  concerning 
the  Bowycr[14]  and  Watson[15]  algorithms.  What 
is  sometimes  described  as  Bowyer’s  algorithm  is 
actually  Watson’s  algorithm.  This  is  surprising 
since  both  the  Bowyer  and  Watson  algorithms  ap¬ 
peared  as  back-to-back  articles  in  the  same  jour¬ 
nal!  The  fundamental  difference  (as  we  will  see) 
is  that  the  Bowyer  algorithm  is  implemented  in 
the  Voronoi  plane  and  the  Watson  algorithm  is 
implemented  in  the  triangulatiou  plane. 

3.3a  Incremental  Insertion  Algorithms 

For  simplicity,  assume  that  the  site  to  be 
added  lies  within  a  bounding  polygon  of  the  ex¬ 
isting  triangulatiou.  If  we  desire  a  triangulatiou 
from  a  new  set  of  sites,  three  initial  phantom 
points  can  always  be  added  which  define  a  tri¬ 
angle  large  enough  to  enclose  all  points  to  be  in¬ 
serted.  In  addition,  interior  boundaries  are  usu¬ 
ally  temporarily  ignored  for  purposes  of  the  De¬ 
launay  triangulation.  After  completing  the  tri¬ 
angulation,  spurious  edges  arc  then  deleted  as  a 
postprocessing  step.  Incremental  insertion  algo¬ 
rithms  begin  by  inserting  a  new  site  into  an  exist¬ 
ing  Delaunay  triangulation.  This  int  reduces  the 
task  of  point  location  in  the  triangulation.  Some 
incremental  algorithms  require  knowing  whic  h  i  ri- 
anglc  the  new  site  falls  within.  Other  algorithms 
require  knowing  any  triangle  whose  circumcirclc 
contains  the  new  site.  In  either  case,  two  ex¬ 
tremes  arise  in  this  reguard.  Typical  mesh  adap¬ 
tation  and  refinement  algorithms  determine  the 
particular  cell  for  site  insertion  as  part  of  the 
mesh  adaptation  algorithm,  thereby  reducing  the 
burden  of  point  location.  In  the  o  ' her  extreme. 


initial  triangulations  of  randomly  distributed  sites 
usually  require  advanced  searching  techniques  for 
point  location  to  achieve  asymptotically  optimal 
complexity  0(N  log  iV).  Search  algorithms  based 
on  quad-tree  and  split-tree  data  structures  work 
extremely  well  in  this  case.  Alternatively,  search 
techniques  based  on  ‘‘walking”  algorithms  are  fre¬ 
quently  used  because  of  their  simplicity.  These 
methods  work  extremely  well  when  successively 
added  points  are  close  together.  The  basic  idea 
is  start  at  the  location  in  the  mesh  of  the  previ¬ 
ously  inserted  point  and  move  one  edge  (or  cell) 
at  a  time  in  the  general  direction  of  the  newly 
added  point.  In  the  worst  case,  each  point  inser¬ 
tion  requires  O(N)  walks.  This  would  result  in 
a  worst  case  overall  complexity  G{N2).  For  ran¬ 
domly  distributed  points,  the  average  point  inser¬ 
tion  requires  C(N*)  walks  which  gives  an  overall 
complexity  O(.V^).  Fo»-  many  applications  where 
successive  points  tend  to  be  close  together,  the 
number  of  walks  is  roughly  constant  and  these 
simple  algorithms  can  be  very  competitive.  Us¬ 
ing  any  of  these  techniques,  we  can  proceed  with 
the  insertion  algorithms. 

Bowyer  s  algorithm, 

The  basic  idea  in  Bowyer’s  algorithm  is  to 
insert  a  new  site  into  an  existing  Voronoi  diagram 
(for  example  site  Q  in  figure  3.6),  determine  its 
territory  (dashed  line  in  figure  3  6),  delete  any 
edges  completely  contained  in  the  territory,  then 
add  new  edges  and  reconfigure  existing  edges  in 
the  diagram.  The  following  is  Rowyor's  algorithm 
essentially  as  presented  by  Bowyer  (?co  referen  . 
(14]  for  complete  details): 

Algorithm:  Incremental  Delaunay  irianguiation, 
Bowyer  |14). 

Step  1.  Insert,  new  point  (site)  Q  into  the  Voronoi 
diagram. 

Step  Find  any  existing  vertex  in  the  Voronoi 
diagram  closer  to  the  new  point  than  to  its  foi cl¬ 
ing  points.  This  vertex  will  be  deleted  in  the  new 
Voronoi  diagram. 

Step  3.  Perform  tree  search  to  find  remaining  set 
of  delctable  vertices  V  that,  are  closer  to  the  new 
point  than  to  their  forming  points.  (In  figure  3.6 
this  would  be  the  set  {113,  iq,  v*> } ) 

Step  4.  Find  the  set  V  of  forming  points  corre¬ 
sponding  1 1  the  deletable  vertices.  In  figure  3.6, 
this  would  be  the  set  ps,7>r}- 

Step  5.  Delete  edges  of  the  Voronoi  which  can 
be  described  by  pairs  of  vertices  in  the  set  V  if 
both  forming  points  of  the  edges  to  be  deleted 
are  contained  in  P 
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Step  6.  Calculate  tlxe  new  vertices  of  the  Voronoi, 
compute  their  forming  points  and  neighboring  ver¬ 
tices,  and  update  the  Voronoi  data  structure. 


Figure  3.6  Voronoi  diagram  modified  by  Bowyer. 

Iinplementational  details  and  suggested  data 
structures  are  given  in  the  paper  by  Bowyer. 

Watson's  algorithm 

Implementation  of  the  "Watson  [15]  algorithm 
is  relatively  straightforward.  The  first  step  is  to 
insert  a  new  site  into  an  existing  Delaunay  trian¬ 
gulation  and  to  find  any  triangle  (the  root)  such 
that  the  new  site  lies  interior  to  that  triangles  cir- 
cumeircle.  Starting  at  the  root,  a  tree  search  is 
performed  to  find  all  triangles  with  circumcircle 
containing  the  new  site.  This  is  accomplished  by 
recursively  checking  triangle  neighbors.  (The  re¬ 
sulting  set  of  deletable  triangles  violating  the  cir- 
cuincircle  criteria  is  independent  of  the  starting 
root.)  Removal  of  the  deletable  triangles  exposes 
a  polygonal  cavity  surrounding  site  Q  with  all  the 
vertices  of  the  polygon  visible  to  site  Q.  The  in¬ 
terior  of  the  cavity  is  then  rctriangulatcd  by  con¬ 
necting  the  vertices  of  the  polygon  to  site  Q,  see 
figu.c  3.7(b).  This  completes  the  algorithm.  A 
thorough  account  of  Watson’s  algorithm  is  given 
by  Baker  [16]  where  he  considers  issues  associated 
with  constrained  triangulations. 

Algorithm:  Incremental  Delaunay  triangulation, 
Watson  [15]. 

Step  1.  Insert  new  site  Q  into  existing  Delaunay 
tri  angulation. 

Step  2.  Find  any  triangle  with  circumcircle  con¬ 
taining  site  Q. 

Step  3.  Perform  tree  search  to  find  remaining  set 
of  deletable  triangles  with  circumcircle  containing 
site  Q. 

Step  4'  Construct  list  of  edges  associated  with 
deletable  triangles.  Delete  all  edges  from  the  list, 
that  appear  more  that  once. 


Step  5.  Connect,  remaining  edges  to  site  Q  and 
update  Delaunay  data  structure. 


Figure  3.7  (a)  Delaunay  triangulation  with  site 
Q  added,  (b)  Triangulation  after  deletion  of  in¬ 
valid  edges  and  reconnection. 

Green  and  Sibson  algorithm 

The  algori'jnn  due  to  Green  and  Sibson  [17] 
is  very  similar  to  the  Watson  algorithm.  The  pri¬ 
mary  difference  is  the  use  of  local  edge  swapping 
to  reconfigure  the  triangulation.  The  first  step  is 
location,  i.e.  find  the  triangle  containing  point 
Q.  Once  this  is  done,  three  edges  are  then  cre¬ 
ated  connecting  Q  to  the  vertices  of  this  triangle 
as  shown  in  figure  3.8(a).  If  the  point  falls  on  an 
edge,  then  the  edge  is  deleted  and  four  edges  are 
created  connecting  to  vertices  of  the  newly  cre¬ 
ated  quadrilateral.  Using  the  circumcircle  criteria 
it.  can  be  shown  that  the  newly  created  edges  (3 
or  4)  are  automatically  Delaunay.  Unfortunately, 
some  of  the  original  edges  are  now  incorrect.  We 
need  to  somehow  find  all  ‘‘suspect”  edges  which 
could  possibly  fail  the  circle  test.  Given  that  this 
can  be  done  (described  below),  each  suspect  edge 
is  viewed  as  a  diagonal  of  the  quadrilateral  formed 
from  the  two  adjacent  triangles.  The  circumcircle 
test  is  applied  to  either  one  of  the  two  adjacent 
triangles  of  the  quadrilateral.  If  the  fourth  point 
of  the  quadrilateral  is  interior  to  this  circumcircle. 
the  suspect  edge  is  then  swapped  as  shown  in  fig¬ 
ure  3.8(b),  two  more  edges  then  become  suspect. 
At  any  given  time  we  can  immediately  identify  all 
suspect  edges.  To  do  this,  first  consider  the  sub¬ 
set  of  all  triangles  which  share  Q  as  a  vertex.  One 
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can  guarantee  at  all  times  that  all  initial  edges  in¬ 
cident  to  Q  are  Delaunay  and  any  edge  made  inci¬ 
dent  to  Q  by  swapping  must  be  Delaunay.  There¬ 
fore,  we  need  only  consider  the  remaining  edges 
of  this  subset  which  form  a  polygon  about  Q  as 
suspect  and  subject  to  the  incircle  test.  The  pro¬ 
cess  terminates  when  all  suspect  edges  pass  the 
circumcircle  test. 


Figure  3.8  (a)  Inserting  of  now  vertex,  (l>)  Swap¬ 
ping  of  suspect  edge. 

The  algorithm  can  be  summarized  as  follows; 

Algorithm:  Incremental  Delaunay  Tri angulation. 
Green  and  Sibson  [17] 

Step  1.  Locate  existing  cell  enclosing  point  Q. 
Step  2.  Insert  site  and  connect  to  3  or  4  surround¬ 
ing  vertices. 

Step  3.  Identify  suspect  edges. 

Step  4-  Perform  edge  swapping  of  all  suspect, 
edges  failing  the  incircle  test. 

Step  5.  Identify  new  suspect  edges. 

Step  0.  If  new  suspect  edges  have  been  created, 
go  to  step  3. 

The  Green  and  Sibson  algorithm  can  be  im¬ 
plemented  using  standard  recursive  programming 
techniques.  The  heart  of  the  algorithm  is  the  re¬ 
cursive  procedure  which  would  take  the  following 
form  for  the  config”ration  shown  in  figure  3.9: 


procedure  swap[  vq,  iq,  u2,  u3,  edges] 
if(mc»rdc[u,,«i,W2,W3]  =  TRUE)then 

call  recoiifig-edgcs[u9,  u, ,  <>2,  v3)  edges ] 
call  swap[tiq,  u\,  jq,  u2,  edges] 
call  swap[ i'<_ ,  i>2)  Vo,  i'-j,  edges ] 
endif 

endproccdure 

This  example  illustrates  an  important,  point. 
The  nature  of  Delaunay  triangulation  guarantees 
that  any  edges  swapped  incident  to  Q  will  be  final 
edges  of  the  Delaunay  triangulatiou.  This  means 
that  we  need  only  consider  forward  propagation 
in  the  recursive  procedure.  In  a  later  section, 
we  will  consider  incremental  insertion  and  edge 
swapping  for  generating  non-Delaunay  triangula¬ 
tions  based  on  other  swapping  criteria.  This  al¬ 
gorithm  can  also  be  programmed  recursively  but 
requires  backward  propagation  in  the  recursive  im¬ 
plementation.  For  the  Delaunay  triangulation  al¬ 
gorithm,  the  insertion  algorithm  would  simplify 
to  the  following  three  steps: 

Recursive  Algorithm:  Incremental  Delaunay 

TViangulation,  Green  and  Sibson 

Step  1.  Locate  existing  cell  enclosing  point  Q. 

Step  2.  Insert  site  and  connect  to  surrounding 

vertices. 

Step  3.  Perform  recursive  edge  swapping  on  newly 
formed  cells  (3  or  4). 


Figure  3.9  Edge  swapping  with  forward  propa¬ 
gation. 

3.3b  Dividc-nnd-Conciucr  Algorithm 

In  this  algorithm,  the  sites  arc  assumed  to  be 
prespecified.  The  idea  is  to  partition  the  cloud  of 
points  T  (sorted  along  a  convenient  axis)  into  left 
(L)  and  right  (Ii)  half  planes.  Each  half  plane 
is  then  recursively  Delaunay  triangulated.  The 
two  halves  must  then  be  merged  together  to  form 
a  single  Delaunay  triangulation.  Note  that  we 
assume  that  the  points  have  been  sorted  along 
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the  x-axis  for  purposes  of  the  following  discussion 
(this  can  he  done  with  0(Arlog  Ar)  complexity). 

Algorithm:  Delaunay  Triaugulation  via  Dividc- 
and-Conquer 

Step  1.  Partition  T  into  two  subsets  7’/,  and  T p 
of  nearly  equal  si/e. 

Step  2.  Delaunay  triangulate  2’/,  and  Tp  recur¬ 
sively. 

Step  3.  Merge  77,  and  Tp  into  a  single  Dehuinay 
triangulation. 


Figure  3.10  Triangulated  subdivisions. 


Figure  3.11  Triangulation  after  merge. 

The  only  difficult  step  in  the  divide-and-conqucr 
algorithm  is  the  merging  of  the  left  and  right  tri- 
angulations.  The  process  is  simplified  by  noting 
t  wo  properties  of  the  merge: 

(1)  Only  cross  edges  (L-ll  or  H-h)  arc  created  in 
the  merging  process.  Since  vertices  are  neither 
added  or  deleted  in  the  merge  process,  the  need 
for  a  new  R-R  or  L-L  edge  indicates  that  the  origi¬ 
nal  right  or  left  triangulation  was  defective.  (Note 
that  the  merging  process  will  require  the  deletion 
of  edges  L-L  and/or  R-R.) 

(2)  Vertices  with  minimum  (maximum)  y  value  in 
the  left,  and  right  triangulations  always  connect 


as  cross  edges.  This  is  obvious  given  that  the 
Delaunay  triangulation  produces  the  convex  : 
of  the  point  cloud. 

Given  these  properties  we  now  outline  the  “ris¬ 
ing  bubble”  [1]  merge  algorithm.  This  algorithm 
produces  cross  edges  in  ascending  y-oidcr.  The 
algorithm  begins  by  forming  a  cross  edge  by  con¬ 
necting  vertices  of  the  loft  and  right  triangula¬ 
tions  with  minimum  y  value  (property  2).  This 
forms  the  initial  cross  edge  for  the  rising  bubble 
algorithm.  More  generally  consider  the  situation 
in  which  we  have  a  cross  edge  between  A  and  B 
and  all  edges  incident  to  the  points  A  and  B  with 
endpoints  above  the  half  plane  formed  by  a  line 
passing  through  A  -  B,  see  figure  3.12. 


Figure  3.12  Circle  of  increasing  radius  in  rising 
bubble  algorithm. 

This  figure  depicts  a  continuously  transformed 
circle  of  increasing  radius  passing  through  the 
points  A  and  B.  Eventually  the  circle  increases 
in  size  and  encounters  a  point  C  from  the  left  or 
right  triangulation  (in  this  case,  point  C  is  in  the 
left  triangulation).  A  new  cross  edge  (dashed  line 
in  figure  3.12)  is  then  formed  by  connecting  this 
point  to  a  vertex  of  .4  -  13  in  the  other  half  trian¬ 
gulation.  Given  the  new  cross  edge,  the  process 
can  then  be  repeated  and  terminates  when  the 
top  of  the  two  meshes  is  reached.  The  deletion 
ot  L-L  or  R-R  edges  can  take  place  during 
or  after  the  addition  of  the  cross  edges.  Prop 
erly  implemented,  the  merge  can  be  carried  oul 
in  linear  time,  O(N).  Denoting  T(N)  as  the  to 
t.al  running  time,  step  2  is  completed  in  approxi 
mately  2T(N/2).  Thus  the  total  running  time  i: 
described  by  the  recurrence  T(N)  —  2T(N/2)  7 
O(N)  =  0(N  log  N). 

3.3c  Tancmura/Merriam  Algorithm 

Another  algorithm  for  performing  Debiuna; 
triangulation  is  the  advancing  front  method  de 
veloped  by  Tanemma,  Ogawa,  and  Ogita  [18]  am 
later  rediscovered  by  Merriam  (10).  Here  the  ide 
is  to  start  with  a  known  boundary  edge  and  fori 
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a  now  triangle  bv  joining  both  endpoints  to  one  of 
the  interior  points.  This  may  generate  up  to  two 
additional  edges,  providing  they  aren't  already 
part  of  another  triangle.  After  all  the  boundary 
edges  have  been  incorporated  into  triangles,  the 
new  edges  will  appear  to  be  a  (somewhat  ragged) 
boundary.  This  moving  boundary  is  often  called 
an  advancing  front.  The  process  continues  until 
the  front  vanishes.  The  problem  here  is  to  make 
the  triangulation  Delaunay.  This  can  be  done 
by  taking  advantage  of  the  incircle  property;  the 
drcumoircle  of  a  Delaunay  triangle  contains  no 
other  points.  This  allows  the  appropriate  point 
to  be  selected  iteratively  as  shown  in  Fig.  3.13. 
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Figure  3.13  A  straightforward  iteration  proce¬ 
dure  selects  the  node  which  generates  the  small¬ 
est  circumcircle  for  a  given  edge.  The  absence  of 
nodes  inside  the  circumcircle  establishes  conver¬ 
gence . 

The  iteration  begins  by  selecting  any  node  which 
is  on  the  desired  side  of  the  given  edge.  If  there 
arc  no  such  nodes,  the  given  edge  is  part  of  a  con¬ 
vex  hull.  Next,  the  circumcircle  is  constructed 
which  passes  through  the  edge  endpoints  and  the 
selected  node.  Finally,  check  for  nodes  inside  this 
circle.  If  there  are  any,  replace  the  selected  node 
with  the  node  closest  to  the  circumecntor  and  re¬ 
peat.  the  process.  When  the  circumcircle  is  empty 
of  nodes,  connect,  the  edge  endpoints  to  the  se¬ 
lected  node. 

3.3d  Delaunay  Trianuulation  Via  Edge  Swapping 

This  algorithm  due  to  Lawson  [11]  assumes 
that  a  triangulation  exists  (not  Delaunay)  then 
makes  it  Delaunay  through  application  of  edge 
swapping  such  that  the  equiangularity  of  the  tri¬ 
angulation  increases  The  equiangularity  of  a  tri- 
angulation,  A(T),  is  defined  as  the  ordering  of 
angles  A(T)  =  [ori,  ...,  such  that 

(v,  <  otj  if  i  <  j.  We  write  A(T*)  <  A(T)  if 
ev*  <  aj  and  or*  -  a,  for  1  <  i  <  j.  A  triangu¬ 
lation  T  is  globally  equiangular  if  A(T*)  <  A(T) 
for  all  triangulations  T *  of  the  point  set.  Law¬ 
son’s  algorithm  examines  all  interior  edges  of  the 
mesh.  Each  of  these  edges  represents  the  diag¬ 
onal  of  the  quadrilateral  formed  by  the  union  of 


the  two  adjacent  triangles.  In  general  one  must 
first  cheek  if  the  quadrilateral  is  convex  so  that 
a  potential  diagonal  swapping  can  place  without 
edge  crossing.  If  the  quadrilateral  is  convex  then 
the  diagonal  position  is  chosen  which  optimizes  a 
local  criterion  (in  this  case  the  local  cquiangular- 
ity).  This  amounts  to  maximizing  the  minimum 
angle  of  the  two  adjacent  triangles.  Lawson’s  al¬ 
gorithm  continues  until  the  mesh  is  locally  opti¬ 
mized  and  locally  equiangular  everywhere.  It  is 
easily  shown  that  the  condition  of  local  equiangu¬ 
larity  is  equivalent,  to  satisfaction  of  the  incirele 
test  described  earlier.  Thei°fore  a  mesh  which 
is  locally  equiangular  everywhere  is  a  Delaunay 
triangulation.  Note  that  each  new  edge  swap¬ 
ping  (triangulation  T*)  insures  that  the  global 
equiangularity  increases  A(T*)  >  A(T).  Because 
the  triangulation  is  of  finite  dimension,  this  guar¬ 
antees  that  the  process  will  terminate  in  a  finite 
number  of  steps. 

Iterative  Algorithm:  Triangulation  via  Law¬ 
son’s  Algorithm 
swap  edge  =  true 
Whilc(swapedge)do 
swapedge  =  false 
Do  ( all  interior  edges ) 

If  ( adjacent  triangles  form  convex  quad)  then 
Swap  diagonal  to  form  T * . 

If  ( optimization  enteria  satisfied) then 
T  =  T* 

swapedge  =  true 
Endlf 

Endlf 

EndDo 

EudWhilc 

When  Lawson’s  algorithm  is  used  for  construct¬ 
ing  Delaunay  triangulations,  the  test  for  quadri¬ 
lateral  convexity  is  not  needed.  It  can  be  shown 
that  nonconvex  quadrilaterals  formed  from  tri¬ 
angle  pairs  never  violate  the  circumcircle  test. 
When  more  general  optimization  criteria  is  used 
(discussed  later),  the  convex  check  must  be  per¬ 
formed. 

3.4  Other  2-D  Triangnlation  Algorithms 

In  this  section,  other  algorithms  which  do 
not  necessarily  produce  Delaunay  triangulations 
are  explored. 

The.  MinMax  Triangulation 

As  Babuska  and  Aziz  [22]  point  out,  from  the 
point  of  view  of  finite  elements  the  MaxMin  (De¬ 
launay)  triangulation  is  not  essential.  What  is 
essential  is  that  no  angle  be  too  dose  to  180®.  In 
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other  words,  triangulatioiis  which  minimize  the 
maximum  angle  are  more  desirable.  These  tri¬ 
angulations  are  referred  to  as  MinMax  triangula¬ 
tions.  One  way  to  generate  a  2-D  MinMax  tri¬ 
angulations  is  via  Lawson’s  edge  swapping  algo¬ 
rithm.  In  the  case,  the  diagonal  position  for  con¬ 
vex  pairs  of  triangles  is  chosen  which  minimizes 
the  maximum  interior  angle  for  both  triangles. 
The  algorithm  is  guaranteed  to  converge  in  a  fi¬ 
nite  number  of  steps  using  arguments  similar  to 
Delaunay  triangulation.  Figures  3.14  and  3.15 
present  a  Delaunay  (MaxMin)  and  MinMax  tri¬ 
angulation  for  100  random  points. 


Figure  3.14  Delaunay  Triangulation. 


Figure  3.15  MinMax  Triangulation. 

Note  that  application  of  local  MinMax  optimiza¬ 
tion  via  Lawson’s  algorithm  may  only  result  in 
a  mesh  which  is  locally  optimal  and  not  neces¬ 
sarily  at  a  global  minimum.  Attaining  a  globally 
optimal  MinMax  triangulation  is  a  much  more 
difficult  task.  The  best  algorithm  to  present  date 
(Edolsbrunnor,  Tan,  and  Waupotitsch  [23])  has  a 
high  complexity  of  0(n~  log  «).  Wiltbcrger  [24] 
has  implemented  a  version  of  the  Green  and  Sib- 
son  algorithm  [17]  which  has  been  modified  to 


produce  locally  optimal  MinMax  triangulations 
using  incremental  insert  ion  and  local  edge  swap¬ 
ping.  The  .algorithm  is  implemented  using  re¬ 
cursive  programming  with  complete  forward  and 
backward  propagation  (contrast  figures  3.1G  and 
3.9).  This  is  a  necessary  step  to  produce  locally 
optimized  meshes. 


Figure  3.16  Edge  swapping  with  forward  and 
backward  propagation  in  Wiltbcrger  algorithm. 

The  MinMax  triangulation  has  proven  to  be  very 
useful  in  CFD.  Figure  3.17  shows  the  Delaunay 
triangulalion  near  the  trailing  edge  region  of  an 
airfoil  with  extreme  point  clustering. 


Figure  3.17  Delaunay  triangulation  near  t  railing 
edge  of  airfoil. 

Upon  first  inspection,  the  mesh  appears  flawed 
near  the  trailing  edge  of  the  airfoil.  Further  in¬ 
spection  and  extreme  magnification  near  the  trail 
edge  of  the  airfoil  (figure  3.18)  indicates  that  the 
grid  is  a  mathematically  correct  Delaunay  tri¬ 
angulation.  Because  the  Delaunay  triangulation 
does  not  control  the  maximum  angle,  the  cells 
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near  the  trailin'?;  edge  luv*’e  angles  approaching 
180°.  The  presence  of  nearly  collapsed  triangles 
leaves  considerable  doubt  as  to  the  accuracy  of 
any  numerical  solutions  computed  in  the  trailing 
edge  region. 


Figure  3-18  Extreme  closcup  of  Delaunay  trian¬ 
gulation  near  trailing  edge  of  airfoil. 

Edge  swapping  based  on  the  MinMax  criteria  via 
Lawson’s  algorithm  or  incremental  insertion  us¬ 
ing  the  Wiltborger  algorithm  produce  the  desired 
result  as  shown  in  figure  3.19. 


Figure  3.19  MinMax  triangulation  near  trailing 
edge  of  airfoil. 

The  Giecdy  Triangulation 

A  greedy  method  is  one  that  never  undoes 
what  it  did  earlier.  The  greedy  triangulation  con¬ 
tinually  adds  edges  compatible  with  the  current 


triangulation  (edge  crossing  not  allowed)  until  the 
triangulation  is  complete,  i.e.  Eider's  formula  is 
satisfied.  One  objective  of  a  triangulation  might 
be  to  choose  a  set  of  edges  with  shortest  total 
length.  The  best  that  the  greedy  algorithm  can 
do  is  adopt  a  local  criterion  whereby  only  the 
shortest  edge  available  at  that  moment  is  con¬ 
sidered  for  addition  to  the  current  triangulation. 
(This  does  not  lead  to  a  triangulation  with  short¬ 
est.  total  length.)  Note  that  greedy  triangulation 
easily  accommodates  constrained  triangvdations 
containing  interior  boundaries  and  a  nonconvex 
outer  boundary.  In  this  case  the  boundary  edges 
are  simply  listed  first  in  the  ordering  of  candidate 
edges.  The  entire  algorithm  is  outlined  below. 

Algorithm:  Greedy  IViangulation 
Step  1.  Initialize  triangulation  T  as  empty. 

Step  Compute  (”)  candidate  edges. 

Step  3.  Order  pool  of  candidate  edges. 

Step  Jf.  Remove  current  edge  es  from  ordered 
pool. 

Step  5.  If(  e#  docs  not  intersect  edges  ofT  )  add 
es  to  T 

Step  6.  If (Euler’s  formula  not.  satisfied)  go  to 

Sfep  4. 


Figure  3.20  Greedy  'IViangulation. 

Figures  3.14  and  3.20  contrast  the  Delaunay 
and  greedy  algorithm.  The  lack  of  angle  con¬ 
trol  is  easily  seen  in  the  greedy  triangulation. 
The  greedy  algorithm  suffers  from  both  high  run¬ 
ning  time  as  well  as  storage.  In  fact  a  naive  im¬ 
plementation  of  Step  5.  leads  to  an  algorithm 
with  0(N 3)  complexity.  Efficient  implementation 
techniques  are  given  in  Gilbert  [25]  with  the  result 
that,  the  complexity  can  be  reduced  to  0(N~  log  N) 
with  0(N2)  storage. 
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Data  Dependent  'lYiawjvlation 

Unlike  mesh  adaptation,  <\  data  dependent 
triaugulation  assumes  t  hat  the  number  and  posi- 
tion  of  vertices  is  fixed  and  unchanging.  Of  all 
poss'ble  triangulations  of  these  vertices,  the  goal 
is  to  find  the  best  triangulation  under  data  depen¬ 
dent.  constraints.  In  Nira,  Levin,  and  Rippa  [26], 
they  consider  several  data  dependent  constraints 
together  with  piecewise  lineal'  interpolation.  In 
order  to  determine  if  a  new  mesh  is  “better"  than 
a  previous  one,  a  local  cost  function  is  defined  for 
each  interior  edge.  Two  choices  which  prove  to  be 
particularly  effective  me  the  JND  (Jump  in  Nor¬ 
mal  Derivatives)  and  the  ABN  (Angle  Between 
Normals).  Using  their  notation,  consider  an  inte¬ 
rior  edge  with  adjacent,  triangles  7j  and  T2.  Let 
P(.i\t/)i  and  P(x,y)i  be  the  linear  interpolation 
polynomials  in  7j  and  T2  respectively: 

Pi (.i;,;/)  --  npr  +  ftiy  +  ci 

P2(.r,  y)  =  a-j.v  +  b2y  +  cs 

The  JND  cost  function  measures  the  jump  in  nor¬ 
mal  derivatives  of  P\  and  P2  across  a  common 
edge  with  normal  components  nr  and  ntJ. 

$(fr,(’)  ~  |nT(rt,  -  a?)  +  »,,(b i  -  b-j)\y 

(JND  cost  function) 
The  ABN  measures  the  acute  angle  between  the 
two  normals  formed  from  the  two  planes  Pt  and 
P2.  Again  using  the  notation  of  [26]: 

,v(/r,c)  =  0  =  cos”1  (A) 


j  _  ci  i  <r  2  +  b\lh>  1 

'/(«{  +  b'j  4-  1  )(nf  -I-  bt  ■{■  1) 

(A13N  cost  function) 
The  next,  step  is  to  construct  a  global  measure  of 
those  cost  functions.  This  measure  is  required  to 
decrease  for  each  legal  edge  swapping.  This  in 
surcs  that,  the  edge  swapping  process  terminates. 
The  simplest  measures  are  the  / 1  and  l  >  norms: 

rt.(/r)  -  J2 

cdyt* 

f?2(/r)  =  Y,  *(/r. **)2 

Recall  that  a  Delaunay  triangulation  would  result 
if  the  cost  function  is  chosen  which  maximizes  the 
minimum  angle  between  adjacent  triangles  (Law- 
son’s  algorithm).  Although  it,  would  be  desirable 
to  obtain  a  global  optimum  for  all  cost  functions, 


this  could  be  very  costly  in  many  cases.  An  d- 
ternale  strategy  is  to  abandon  the  pursuit  of  a 
globally  optimal  triaugulation  in  favor  of  a  lo¬ 
cally  optimal  tri«inguhition.  Once  again  Lawson's 
algorithm  is  used.  Note  that  in  using  Lawson’s  al¬ 
gorithm,  we  require  that  the  global  measure  de¬ 
crease  at  each  edge  swap.  This  is  not  as  simple 
as  before  since  each  edge  swap  can  have  an  effect 
on  other  surrounding  edge  cost,  functions,  Never¬ 
theless,  this  domain  of  influence'  is  very  small  and 
easily  found. 

Iterative  Algorithm:  Data  Dependent  TViau- 
gulation  via  Modified  Lawson’s  Algorithm 
swapedge  =  true 
Whilc(s\vapedgc)do 
swapedge  *  false 
Do  (all  interior  edges) 

If  ( adjacent  triangles 

form  convex  quadrilatcrat)i\n'u 
Sump  diagonal  to  form  7”. 

If  (P(/r-)  </?(/r))then 

T  —  T* 

swapedge  =  true 
Endlf 
Etullf 
EiulDo 
EndWliilo 

Edge  swapping  only  occurs  when  /?(//••)  <  R[f  r) 
which  guarantees  that  the  method  terminates  in 
a  finite  number  of  steps.  Figures  3.14  and  3.21 
plot  tlie  Delaunay  trianguiation  of  100  random 
vertices  in  a  unit  square  and  piecewise  linear  con¬ 
tours  of  (1  -f-  tanh(9i/  —  9;r))/9  on  this  mesh.  The 
exact  solution  consists  of  straight  line  contours 
with  unit  slope. 


Figure  3.21  Piecewise  Linear  Interpolation  of 
(1  +  t,anh(9,y  —  9;r))/9. 


T n  figures  3.22  and  3.23  the  data  dependent  tri- 
angulation  and  solution  contours  using  the  JND 
criteria  and  /]  measure  suggested  in  [2(>]  are  plot¬ 
ted. 


Figure  3.22.  Data  Dependent  TYiangulntion. 


Figure  3.23  Piecewise  Linear  Interpolation  of 
( 1  4-  tiinh(9t/  -  9.r))/9. 

Note  that  tin  t  rial  gulat  ions  obtained  from  this 
method  are  not  globally  optimal  and  highly  de¬ 
pendent  on  the  order  in  which  edges  are  ac<  eased. 
Several  possible  ordering  strategies  are  mentioned 
in  (27|. 

3.5  2-D  Sieiner  Triaugnlations 

Definition:  A  Steiner  triangulation  is  any  tri- 
angulatien  that  adds  additional  sites  to  an  exist¬ 
ing  t  riangulation  to  improve  some  measure  of  grid 
quality. 

Technically  speaking,  the  method  of  advancing 
front  grid  generation  discussed  by  Professors  Mor¬ 
gan  and  Ldlmer  in  those  notes  would  be  a  spe¬ 
cial  type  of  Steiner  triangulation.  The  insertion 
algorithms  described  earlier  also  provide  a  sim¬ 
ple  mctL.nism  lor  generating  Steiner  triangula- 
tions.  Holmes  [28]  demonstrated  the  feasibility  of 


inserting  sites  at  circumcentersof  Delaunay  trian¬ 
gles  into  an  existing  2-D  triangulation  to  improve 
measures  of  grid  quality,  This  has  the  desired 
effect  of  placing  the  new  site  in  a  position  that, 
guarantees  that  no  other  site  in  the  triangulation 
can  lie  closer  that  the  radius  of  the  cireumcirclc. 
see  figure  3.24.  In  a  loose  sense,  the  new  site 
is  placed  ns  far  away  from  other  nearby  sites  as 
conservatively  possible. 
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Figure  3.24  Inserting  site  at.  cimiincentcr  of  acute 
and  obtuse  triangles. 

Warren  <•/  a/ [29]  and  Anderson  [30]  further  demon¬ 
strated  the  utility  of  this  type  of  Steiner  trian¬ 
gulation  in  the  generation  and  adaptive  refine¬ 
ment.  of  2-D  meshes.  The  algorithm  developed 
by  Wiltberger  [24]  also  permits  Steiner  Iriaugu- 
lations  based  on  either  MinMax  or  MaxMin  (De¬ 
launay)  insertion.  Only  in  the  latter  case  is  the 
insertion  at  triangle  eimuncenters  truly  juslifi- 
ablc  The  paragraphs  below  give  an  expanded 
discussion  of  2-D  Steiner  triangulation. 

Steiner  Grid  Generation 

The  2-D  Steiner  point  grid  generation  algo¬ 
rithm  described  in  [28,29,30]  consists  of  the  fol¬ 
lowing  steps.  Tbe  first  step  is  the  Delaunay  trian¬ 
gulation  of  the  boundary  data.  Usually  three  or 
four  points  are  placed  in  the  far  field  with  convex 
hull  enclosing  all  the  boundary  points.  Starting 
with  a  triangulation  of  these  points,  sites  corre¬ 
sponding  to  boundary  curves  arc  incrementally 
inserted  using  Watson’s  algorithm  in  [28,29,30] 
and  Green  and  Sibson’s  algorithm  in  [17]  as  shown 
in  figure  3.25.  The  initial  triangulation  u  \s  not 
guarantee  that  all  boundary  edges  are  members  of 
the  triangulation.  This  can  be  remedied  in  a  vari¬ 
ety  of  ways.  One  technique  adds  additional  points 
to  the  triangulation  so  as  to  guarantee  that  the 
resulting  Delaunay  triangulation  contains  all  the 
desired  boundary  edges,  see  reference  [10].  An¬ 
other  approach  performs  local  edge  swapping  so 
as  to  produce  a  constrained  Delaunay  triangu¬ 
lation  which  guarantees  that  all  boundary  edges 


iitf  actual  «'<!i'ps  of  tin*  triangulation. 
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Figure  3.2G  Initial  triuugulatioti  of  boundary 
points. 

In  either  event,  tin'  boundary  edges  are  marked  so 
that  they  cannot  In'  removed  as  the  triangulation 
is  refined.  The  algorithms  desoihed  in  [28,29,30) 
interrogate  triangles  in  an  arbitrary  onlor  (this 
makes  tlm  triaugulntion  lionuuique).  Tlu*  user 
must  specify  some  measure  of  quality  for  triangle 
roliuoinont  (aspeet  ratio,  area,  contniument  eirele 
radius,  for  example)  and  a  threshold  value  for  the 
measure.  If  a  triangle  fails  to  meet  the  threshold 
value,  the  triaugulatiou  is  refilled  by  placing  a 
new  site  at  the  cireum •■enter  >«f  the  failed  trian¬ 
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taken  to  insure  that  measures  are  chosen  which 
are  guaranteed  to  he  reduced  when  the  refine¬ 
ment  takes  place.  Using  thresholding  in  this  way 
does  not  give  the  user  direct,  wiitvol  over  the  ac¬ 
tual  number  of  triangle  generated  in  the  process 
of  Steiner  refinement.  Wiltbcrgei  takes  a  dif¬ 
ferent  approach  by  maintaining  a  th/mimic  heap 
data  structure  of  the  quality  measure.  (Heap 
structures  are  a  very  efficient  way  of  keeping  a 
sorted  list  of  entries  with  insertion  and  query  time 
O(log.Y)  for  .V  entries.)  The  triangle  with  the 
largest  value  of  the  specified  measure  will  be  lo¬ 
cated  at,  tlie  top  of  the  heap  at  all  times  dur¬ 
ing  the  triangulation.  This  makes  implementa¬ 
tion  of  a  Steiner  triangulation  which  minimizes 
the  maximum  value  of  the  measure  very  efficient 
(and  unique).  In  this  implementation,  the  user 
call  either  specify  the  number  of  triangles  to  be 
generated  or  a  threshold  value  of  the  measure. 
Note  that  multiple  measures  can  be  refined  lex¬ 
icographically.  Figure  3.26  shows  a  Steiner  tri- 


mignlation  using  the  Wiltborger  algorithm  with 
MnxMin  insertion  and  refinement  based  on  max¬ 
imum  aspect  ratio. 


Figure  3.26  Steiner  triangulation  with  sites  in¬ 
serted  at  cireunicenters  to  reduce  maximum  cell 
aspect  ratio. 
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Figure  3.27  Steiner  triangulation  of  Texas  coast 
and  the  Gulf  of  Mexico. 


For  instance,  figure  3.27  shows  a  Steiner  triangu- 
lat  ion  of  the  Texas  coast  and  Gulf  of  Mexico. 

3.6  Three-Dimensional  Triangnlations 

The  Delaunay  triangulation  extends  naturally 
into  three  dimensions  as  the  geometric  dual  of  the 
3-D  Voronoi  diagram.  The  Delaunay  triangula¬ 
tion  in  3-D  can  be  characterized  as  the  unique 
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triangulation  such  that  the  circumsphore  passing 
through  ft  four  vortices  of  any  tetrahedron  must 
not  c<"‘'  jii  any  other  point  in  the  trinngulatiou. 
A'  ’■  the  2-D  case.  the  3-D  Delaunay  trinnguln- 
t ton  has  the  property  that  it  minimizes  the  max¬ 
imum  containment,  sphere  (globally  but  not  lo¬ 
cally).  In  two  dimensions,  it  can  be  shown  that 
a  mesh  entirely  comprised  of  acute  triangles  is 
automatically  Deimmav.  To  prove  this,  consider 
an  adjacent  triangle  pair  forming  a  tjuadrilalernl. 
By  swapping  the  position  of  the  diagonal  it.  is 
easily  shown  that  the  minimum  angle  always  in¬ 
creases.  Ha jiui  (12)  shows  the  natural  extension 
of  this  idea  to  three  or  more  space  dimensions, 
lie  delines  a  "self-eentered”  simplex  in  It.'1  to  be 
a  simplex  which  has  the  circmncenter  of  its  cir- 
cutnspheie  interior  to  the  simplex.  In  two  dimen¬ 
sions,  acute  triangles  are  self-eentered  and  obtuse 
triangles  are  not.  Hajan  shows  that  a  triangnln- 
tion  entirely  composed  of  self-centered  simplires 
in  RJ  is  automaticnlly  Delauuay. 

d.Oa  3-1)  Bowycr  Mid  Watson  Algorithms 

The  algorithms  of  Bowyor  (11)  and  Watson 
(15)  extend  naturally  to  three  dimensions  with  es- 
t  inia.ed  complexities  of  0(.VS/')  and  ()( .Y^s)  for 
,V  randomly  distributed  vertices.  They  do  not 
give  worst  ense  estimates.  It  should  be  noted 
that  in  three  dimensions,  Klee  (20)  shows  that 
the  maximum  number  of  tetrahedra  which  can 
be  generated  from  .V  vertices  is  (){ .Y~‘).  'Huts  an 
optimal  worst  ctuse  complexity  wouhl  be  a  least 
f)(.Y*).  Under  normal  conditions  this  worst  case 
scenario  is  rarely  encountered,  linker  (1(>)  reports 
more  realistic  actual  run  times  for  Watsons  aigu- 
ritlnii. 

3. lib  3- 1)  Kdee  Swanoine  Ahtorithtns 

Until  most  recently,  the  algorithm  of  Green 
and  Sibson  biased  on  edge  swapping  was  thought 
not  to  be  extendable  to  three  dimensions  because 
it  was  unclear  how  to  generalize  the  concept  of 
edge  swapping  to  three  or  move  dimensions.  In 
19S6,  Lawson  published  a  paper  (21)  in  which  lie 
proved  the  fundamental  combinatorial  result: 

Theorem:  (Lawson.  198G)  The  convex  hull  of 
u  +  2  points  in  IV1  can  be  triangulated  in  at  most 
2  ways. 

.loe  (31,32),  and  Hajan  [12)  have  constructed 
algorithms  based  on  this  theorem.  In  joint,  work 
with  A.  Gandhi  (33),  we  independently  constructed 
an  increment <il  Delaunay  triangulation  algorithm 
based  on  Lawson's  theorem.  The  remainder  of 
tins  section  will  review  our  algorithm  and  the  ba¬ 
sic  ideas  behind  3-D  edge  swapping  algorithms. 


It  is  useful  to  develop  a  taxonomy  of  pos¬ 
sible  configurations  addressed  by  Lawson's  theo¬ 
rem.  l-'igure  3.2S  shows  configurations  in  3-D  of 
live  points  which  can  be  triangulated  in  only  one 
way  and  hence  n.»  change  is  possible.  We  call 
these  arrangements  "unswappable  ’.  Figure  3.29 
shows  configurations  which  allow  two  ways  of  tri- 
angulation.  It  is  possible  to  flip  between  the  two 
possible  triangulations  ami  we  call  these  arrange¬ 
ments  “swappable” . 


Figure  3.2ft  Generic  nonswappable  configurations 
of  5  points.  Shaded  region  denotes  planar  surface. 


Figure  3.29  Generic  swappable  configurations  of 
5  points.  Shaded  region  denotes  planar  surface. 

There  are  two  arrangements  that  allow  two 
trimignlatious.  Figures  3.29(a)  and  3.29(h)  show 
the  subclass  of  companion  triungulatioiis  t  hat  can 
be  transformed  from  one  type  to  another  thereby 
changing  the  number  of  tetrahedra  front  2  to  3 
or  vice-versa.  Figures  3.29(c)  and  3. 29(d)  show 
the  other  subclass  of  configurations  that  can  be 
transformed  from  one  type  to  another  while  keep¬ 
ing  constant  the  number  of  tetrahedra  (2),  These 
figures  reveal  an  important  difference  between  the 
two  and  three-dimensional  algorithms.  Thr  num¬ 
ber  of  tetrahedrons  involved  in  the  sivtippmg  op¬ 
eration  need  not  be  constant. 

Tin*  3-D  edge  swapping  algorithm  is  based 
on  Hipping  between  the  two  ways  of  triangulating 
the  configurations  in  figure  3.29.  One  good  way  of 
finding  all  sots  of  five  points  in  the  mesh  is  to  loop 
through  all  the  faces  in  the  mesh  and  considering 
the  five  points  that  make  up  the  two  adjoining 
tetrahedra  for  that.  face,  below  we  present  the 
facewise  edge  swapping  primitive. 

Primitive:  EDGE-SWAP(face) 

Lot.  C  as  {  Set  of  tetrahedra  made  from  the  5 
nodes  of  the  two  adjoining  tetrahedra  } 


If(slin|>i-(C)  —  ooti\vx)lhen 
l.i't  7  =  current  Iriniignlnlioii 
Lrt  T *  —  alternate  triaugulnlioii  (if  it  exists) 
lf(  Qunlity('P)  >  Quality!  7'))then 
[Kdgc  Swap  /  into  /  *j 

Etnlif 
End  if 

Tin1  first  step  is  to  find  all  tin'  Ictnihedra 
that  arc*  described  by  the  five  nodes  <>f  tho  two 
tctralicdra  adjacent  to  tin'  faro  in  question  There 
is  a  maximum  of  four  tclrahedra  that  can  ho  hnilt 
float  five  nodes.  .Since  any  two  tclraliedni  math' 
from  tho  satin'  livo  points  will  lutvo  to  sluiro  three 
points  (i.o..  it  fan')  it  is  sufficient  to  only  look 
at  tho  four  neighboring  tcirnhcdiu  of  any  of  tho 
two  telrnhedin  already  known.  This  const  mites 
a  linear  time  algorithm  for  tinding  till  tho  non- 
nvorlnpping  totrnhcdrn  tnatlo  from  tho  fivo  point  s. 

If  thoso  totrnltotlrn  form  «  convex  sltnpo,  then 
tho  configuration  is  tlcsoribotl  by  otto  of  tho  coniig- 
ttiitiiotis  in  lipcttros  3.28  tun  I  3.29  (configurations  of 
tin-  ooitvt'x  hull)  tuul  edge  swapping  is  permitted. 
If.  for  example,  only  two  of  the  three  totrahodra 
in  figure  3.29(h)  were  present,  the  two  tetrahe- 
<lra  will  form  a  concave  shape.  Obviously,  edge 
swapping  concave  shapes  is  not  possible  without 
possibly  cleat  inn  overlapping  tetrnhodra  in  the 
mesh.  For  swappable  configurations,  a  cheek  is 
performed  to  see  if  the  local  mesh  quality  moa 
sure  (discussed  further  below)  will  improve  by 
edge  swapping  into  the  iilternate  triungulntion.  If 
it  does,  the  swap  is  performed;  otherwise  the  tn- 
augulation  is  unchanged.  This  teehmque  oilers  a 
distinct  advantage  over  others  (Bowyer's  or  Wat¬ 
son's  algorithm)  in  that  it  allows  tin*  use  of  any 
arbitrary  mesh  quality  measure. 

Computational  Asprcts  of  S- I)  Edge  Swapping 

Before  procoding  further,  it  is  useful  to  dis¬ 
cuss  the  eomputat ional  aspects  of  ome  of  t  he  op¬ 
erations  neeiled  for  the  edge  swapping  algorithm. 

♦  Determining  Convexity:  This  operation  tests 
whether  the  shape  formed  by  the  tetrahedron  T\ 
and  T>  is  convex,  bet  the  vertices  of  the  totra¬ 
hodra  l>e  numbered  Tj  =  (1,2.3,  1)  and  T->  — 
(1,2, 3, 5)  i.e.,  (1,2,3)  is  the  face  sharer)  by  T\ 
and  To  and  nodes  -1  and  5  are  at  the  two  ends 
of  T\  and  T,  respectively  We  make  use  of  the 
notion  of  haryeentric  coordinates  to  perform  the 
convexity  test.  The  f>i,2.;i.t  satisfying 


are  called  the  bniycctitl'ie  coordinatea  of  node  5. 
They  indicate  the  position  of  .7  in  relation  to  the 
nodes  of  telrahedioit  7 1 .  k or  each  .■»,  the  sign  of 
It,  indicates  the  position  of  5  relative  to  the  plane 
II,  passing  through  the  triangular  face  opposite 
node  Thus  l>.  -■  0  when  f>  is  in  H, .  h,  >  0  when 
r»  is  on  tin'  same  side  ot  II,  as  node  s.  and  />«  <  D 
when  a  is  on  the  opposite  side  of  //.,  Irutn  node 
>.  Clearlv.  I>\  '» ,i  t  >  0  if  f>  lies  inside  tet rnlii'dron 
\  i .  2, 8,  1 ).  If  we  imagine  a  cone  formed  b\  planes 
II |  ,  of  7|.  then  7|  tuul  '!■}  would  form  a  i  onve.x 

shape  if  and  only  if  node  a  lies  in  the  cone  oil  the 
side  opposite  from  node  l  (ligllte  3.3(1). 


Figure  3.30  Convexity  cone  for  node  1 

The  conditions  to  satisfy  this  requirement  are  /q  < 
t)  and  l>\ ,<„i  ■>  t).  In  order  to  test  if  three  cells  form 
a  convex  shape,  the  nodes  are  renumbered  as  if 
the  three  cell  configuration  were  edge  swapped 
into  the  corresponding  two  cell  configuration  lor 
purposes  of  the  haryeentric  test.  For  example, 
consider  a  three  cell  configuration  with  number¬ 
ing  ( 1, 2,3, 4).  (2,3. 4,3),  and  ( 1, 2, 4, o).  then  the 
corresponding  two  cell  configuration  would  have 

i  \  *1  e  \  .w  I  1\.\  ,'juoti  w\o  f'li'ii  'mil  iiivilim  *)  oriil  .1 
\  l  y  \j  j  no  ttn  mo.  \  «*»»»♦  * *'-•%*'  •*  «.  > 

at  I  lie  ends  ot  the  two  tctralicdra.  Notice  that 
if  the  three  totrahodra  formed  a  convex  shape, 
tin'll  it  would  bo  possible  to  edge  swap  n  to  a 
convex  two  tctralicdra  eouliguratioti  It  the  three 
cells  formed  a  concave  shape,  however,  die  trams 
formed  two  cell  triangulatiou  would  contain  over¬ 
lapping  totrahodra  which  the  test  above  would 
label  as  concave. 

Using  Cramer's  rule  to  solve  the  .-hr  =  /.» 
problem  posed  above  requires  computing  deter¬ 
minants  of  the  form 
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five  times.  An  optimizat  ion  is  possible  by  exploit¬ 
ing  the  property  of  determinants  that  subtract¬ 
ing  one  row  or  column  from  another  leaves  the 
determinant  unchanged.  If  we  subtract  the  iirst 
column  from  the  rest,  we  simplify  t  he  above  4x1 
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determinant  into  a  3x3  one. 
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•  Delaunay  Circumspliere  test:  The  3-D  Delau¬ 
nay  triangulation  is  defined  as  the  unique  trian¬ 
gulation  such  that  the  circumspliere  of  any  tetra¬ 
hedron  contains  no  other  point  in  the  inesh.  To 
determine  where  point  E  lies  in  relation  to  the  cir- 
cumsphere  of  tetrahedron  (A,  B,  C\  D),  denoted 
hy(Q  ABCD),  we  use  the  InSphcre  primitive  : 


(  <  0  if  £  is  inside  O  ABCD 
InSphere(E)  I  =  0  if  E  is  on  O  ABCD 

l  >  0  if  E  is  outside  Q  ABCD 

when  InSphcre  is  computed  from  the  following 
determinant: 

InSphere(E) 
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and  w'j,  =  x2p  +  y),  +  z), 

The  first  determinant  is  the  3-D  extension 
of  Guibas’  InCircie  primitive  [1].  It  represents 
the  volume  of  a  pentatope  whose  vertices  are  the 
points  A.  B,  C,  D,  E  projected  onto  the  -1-D 
paraboloid  ( x 2  +  y2  +  z2).  (A  pentatope  is  the 
simplest  polytope  .n  4-D  just  as  a  tetrahedron  is 
the  simplest  polytope  in  3-D  and  a  triangle  in  2- 
D.  A  pentatope  can  be  constructed  by  joining  the 
tetrahedron  to  a  fifth  point  outside  its  3-space.) 
The  coordinates  in  3-space  of  these  five  points 
remain  unchanged;  they  simply  acquire  a  value 
in  their  fourth  coordinate  equal  to  the  square  of 
their  distances  from  the  origin.  The  volume  of 
this  polytope  is  positive  if  point  E  lies  outside 


O  ABCD  and  negative  if  point  E  lies  inside  O 
ABCD ,  provided  that  tetrahedron  (A,  B,C,  D) 
has  a  positive  volume  (as  given  by  the  second 
determinant).  The  determinant  is  degenerate  if 
point  E  lies  exactly  on  O  ABCD. 

This  test  is  motivated  by  the  observation  in 
3-D  that  the  intersection  of  a  cylinder  and  a  unit 
paraboloid  is  an  ellipse  lying  in  a  plane  (figure 
3.31).  So,  any  four  co-circular  points  in  2-D  will 
project  to  four  co-planar  points  and  the  volume 
of  the  tetrahedra  made  from  these  four  co-planar 
points  will  be  zero.  The  paraboloid  is  a  surface 
that  is  convex  upward  and  the  points  interior  to 
a  circle  get  projected  to  the  paraboloid  below  the 
intersection  plane  and  the  points  exterior  to  it 
get  projected  above  the  the  intersection  plane. 
If  a  point  lies  outside  the  circumcircle  of  three 
other  points,  the  tetrahedron  made  from  these 
four  points  will  have  positive  volume  provided  the 
three  points  were  ordered  in  a  counter-clockwise 
fashion.  The  volume  will  be  negative  if  the  point 
lies  inside  the  circumcircle. 


Figure  3.31  Projection  of  cocircular  points  onto 
unit  paraboloid. 

The  second  determinant  is  the  3-D  exten¬ 
sion  of  Guibas’  CCW  (counter  clock-wise)  prim¬ 
itive  [1]  which  computes  the  volume  of  tetrahe¬ 
dron  ( A,B,C,D ).  Thus  the  InSphere  primitive 
works  irrespective  of  how  the  pointj  A,  B,  C, 
and  D  arc  ordered.  If  it  can  be  guaranteed  that 
all  the  tetrahedra  in  a  mesh  have  their  vertices 
ordered  to  have  positive  volumes  then  the  need 
to  compute  the  second  determinant  is  eliminated. 
The  InSphere  primitive  becomes  ili-bchavcd  when 
the  points  A ,  B ,  C,  and  D  all  lie  nearly  on  a 
plane  because  the  position  of  the  circumspliere 
with  respect  to  the  points  (i.e.,  whether  the  cir- 
cumsphere  is  above  or  below  the  plane)  becomes 
very  sensitive  to  small  perturbations  in  the  coor¬ 
dinates  of  the  five  points. 
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3-D  Mesh  Optimization 

The  3-D  edge  swapping  algorithm  can  be  used 
to  optimize  existing  triangulations.  In  fact,  there 
is  no  way  to  triangulate  a  givea  set  of  points  based 
on  the  minmax  or  maxmin  cf  the  face  angles  di¬ 
rectly.  An  alternative  is  to  start  with  an  existing 
triangulation  and  optimize  it.  This  requires  that 
we  cycle  through  all  the  faces  in  a  mesh  and  apply 
the  edge  swapping  procedure  at  each  step.  This 
process  is  continued  until  no  more  swaps  are  pos¬ 
sible. 

Algorithm:  Three-dimensional  mesh  optimiza¬ 
tion. 

while  (swaps  occurred  in  the  last  cycle  over  faces) 
for  all  faces 
EDGE-SWAP  (face) 
endfor 
endwhile 

In  the  following  paragraphs,  we  discuss  a  few  swap- 
criteria  and  examine  the  meshes  they  produce. 

Global  Edge  Swapping 

The  InSphere  criteria  is  binary  in  the  sense 
that  either  the  triangulation  of  a  set  of  five  points 
satisfies  the  criteria  or  it  docs  not.  It  can  also  be 
shown  that  of  the  two  ways  to  triangulate  a  sot  of 
five  points,  if  one  way  fails  the  InSphere  criteria, 
then  the  other  one  will  pass  and  vice-versa.  Cases 
1  and  3  in  figure  3.15  and  cases  1,  3,  and  5  in 
figure  3.1G  will  always  pass  the  InSphere  criteria. 

The  Delaunay  triangulation  is  unique  for  a 
given  set  of  points.  Lawson  also  noticed  the  re¬ 
lation  between  local  and  global  properties  ol  tire 
Delaunay  InSphere  criteria:  a  triangulation  is  De¬ 
launay  if  and  only  if  the  triangulations  of  sets  of 
five  points  corresponding  to  all  the  interior  faces 
in  the  mesh  satisfy  the  InSphere  criteria.  This 
means  that  if  every  face  satisfies  the  Delaunay 
criteria,  then  the  whole  mesh  must  be  a  Delau¬ 
nay  triangulation. 

Joe  (32)  has  proven,  however,  that  process¬ 
ing  faces  in  an  arbitrary  way  may  result  in  getting 
stuck  in  local  optima.  This  is  an  important  dif¬ 
ference  between  two  and  three  dimensional  com¬ 
binatorial  edge  swapping. 

3-D  MinMax  and  MaxMin  Triangulations 

The  edge  swapping  algorithm  can  lie  applied 
locally  to  produce  a  triangulation  that  minimizes 
the  maximum  face  angle.  In  2-D,  the  edge  swap¬ 
ping  algorithm  (working  with  edge  angles)  gets 
stuck  in  local  minima  and  depending  on  the  or¬ 
der  in  which  the  edges  were  traversed,  different 
local  minima  are  reached,  In  practice,  the  local 


minima  all  seem  very  close  to  the  global  mini¬ 
mum  which  makes  edge  swapping  a  practical  way 
to  get  a  nearly  optimal  MinMax  triangulation. 
We  observe  that  in  3-D  as  well,  there  are  many 
local  minima  and  the  order  of  face  traversal  deter¬ 
mines  which  one  is  found.  It  is  hard  to  determine 
how  far  these  local  minima  are  from  the  global 
minimum  but  we  believe  that  edge  swapping  is 
a  practical  way  to  get  nearly  optimal  MinMax 
meslies. 

Lawson  has  shown  that  in  2-D,  Delaunay  tri¬ 
angulations  have  the  property  that  the  minimum 
edge  angle  is  maximized  (i.e.,  MaxMin  triangu- 
lation).  So  in  2-D,  the  MaxMin  triangulation  is 
unique  and  the  edge  swapping  algorithm  will  con¬ 
verge  to  it.  In  3-D,  however,  the  Delaunay  trian¬ 
gulation  is  not  the  same  as  MaxMin  triangulation 
and  the  edge  swapping  algorithm  working  with 
the  MaxMin  criteria  has  the  same  property  of  get¬ 
ting  stuck  in  local  minima  as  the  MinMax.  Again, 
it  is  hard  to  judge  how  close  the  local  minima  are 
from  the  global  minimum  but  we  still  conclude 
that  edge  swapping  is  a  fairly  efficient  technique 
for  the  construction  of  MaxMin  triangulations. 

3-D  Minimum  Edge  Triangulation 

Another  mesh  of  interest  is  the  minimum 
edge  triangulation.  Since  finite- volume  flow  solvers 
work  edge- wise,  it  is  beneficial  to  reduce  the  num¬ 
ber  of  edges  in  a  mesh.  This  is  easily  accom¬ 
plished  by  edge  swapping  such  that  wc  always 
swap  from  case  3.29a  to  case  3.29b.  Each  time 
this  operation  is  performed,  one  edge  and  one 
tetrahedron  are  removed  from  the  mesh.  Again, 
different  meshes  will  be  produced  depending  upon 
how  faces  are  traversed  and  the  final  mesh  may 
only  be  at  a  loeal  minimum. 

Incremental  Delaunay  Triangulation 

The  edge  swapping  algorithm  provides  an  ef¬ 
fective  way  for  inserting  a  point  into  an  existing 
triangulation.  Simply  find  the  tetrahedra  into 
which  the  point  is  to  be  inserted  and  test  its  faces 
according  to  the  circumsphere  criteria  to  deter¬ 
mine  if  edge  swapping  should  take  place.  If  a 
set  of  5  points  corresponding  to  a  face  is  retri- 
angulated,  we  proceed  to  test  ail  the  outer  faces 
of  the  new  triangulation  for  swappability  and  so 
on.  This  propagates  a  front,  that  retriangulates 
the  mesh.  It  is  known  that  any  new  face  created 
during  the  retriangulation  is  indeed  a  part  of  the 
final  mesh  as  well,  and  so  back-propagation  is  not 
required.  This  may  not  be  true  for  other  mesh 
quality  measures  and  back  propagation  then  be¬ 
comes  necessary. 
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Rajan  proves  that  it  is  possible  to  find  a  cer¬ 
tain  sequence  of  edge  swaps  which  will  guarantee 
that  Delaunay  triangulation  is  recovered  when  a 
site  is  added  to  an  existing  Delaunay  triangula¬ 
tion.  In  practice,  however,  we  find  that  this  order¬ 
ing  of  edge  swaps  does  not  seem  to  be  necessary 
in  order  to  recover  the  Delaunay  triangulation. 
In  fact,  it  is  our  conjecture  that  this  is  always  the 
case. 

This  insertion  algorithm  can  be  used  to  adap¬ 
tively  refine  meshes.  To  do  this,  sites  are  inserted 
at  the  centers  of  the  circumspheres  of  tetrahedra 
with  large  aspect  ratios  (or  other  suitable  mea¬ 
sures).  This  insertion  site  does  not  always  lie 
within  the  cell  T,  marked  for  refinement.  To  find 
the  cell  in  which  the  new  site  lies,  a  walking  al¬ 
gorithm  is  employed.  Starting  at  7),  barycentrics 
are  computed  to  determine  which  face  of  T,  the 
new  site  lies  behind.  The  next  step  is  to  traverse 
to  the  cell  behind  that  face.  This  procedure  is  ap¬ 
plied  recursively  until  the  cell  in  which  the  new 
site  falls  within  is  found.  The  idea  of  introduc¬ 
ing  new  sites  at  the  centers  of  the  circumspheres 
of  tetrahedra  works  well  because  each  new  site 
intre  '  iced  is  equidistant  to  the  4  points  of  the 
large  aspect  ratio  tetrahedra.  This  produces  high 
quality  meshes  in  2-D  and  seems  to  work  well  in 
3-D. 

3.6c  3-D  Surface  Tr  -  gulation 

The  WiltO-  algorithm  has  been  extended 
to  include  the  gulation  of  surface  patches. 

Although  th'  i  v'pt.  of  Dirichlet  tessellation  is 
well  defi""'1  on  a  .  ooch  manifolds  using  the  con¬ 
cept  of  od.'sic  <h  nice,  in  practice  this  is  too 
expense'  Fmdi'.g  geodesic  distance  is  a  varia¬ 
tion  ;  •>  at  is  not  easily  solved.  We  have 

•  ‘if'  .mpler  procedure  in  which  surface 

grids  in  o-iJ  are  constructed  from  rectangular  sur¬ 
face  patches  (assumed  at  least  C°  smooth)  using 
a  generalization  of  the  2-D  Steiner  triangulation 
scheme. 


patch  using  an  adaptive  refinement  strategy  based 
on  absolute  error  and  curvature  measures.  The 
sijrface  patches  are  projected  onto  the  plane,  see 
figure  3.32.  Simple  stretching  of  the  rectangular 
patches  permits  the  user  to  produce  preferentially 
stretched  meshes.  (This  is  useful  near  the  leading 
edge  of  a  wing  for  example.) 

The  triangulation  takes  place  in  the  two  di¬ 
mensional  (s,  ts  plane.  The  triangulation  is  adap¬ 
tively  refined  using  Steiner  point  insertion  to  min¬ 
imize  the  maximum  user  specified  absolute  error 
and  curvature  tolerance  on  each  patch.  The  ab¬ 
solute  error  is  approximated  by  the  perpendicu¬ 
lar  distance  from  the  triangle  centroid  (projected 
back  to  3-space)  to  the  true  surface  as  depicted  in 
figure  3.33.  The  user  can  further  refine  based  on 
triangle  aspect  ratio  in  the  (s,  t)  plane  if  desired. 


Figure  3.33  Calculation  of  triangulation  abso¬ 
lute  error  by  measurement  of  distance  from  face 
centroid  to  true  surface. 


'igure  3.34  shows  a  typical  adaptive  surface  grid 
^iterated  using  the  Steiner  triangulation  method. 


Figure  3.32  Mapping  of  rectangular  patches  on 
(s,  t)  plane. 

Points  are  first  placed  on  the  perimeter  of  each 


Figure  3.34  Adaptive  Steiner  triangulation  of 
surface  mesh  about  Boeing  737  with  flaps  de¬ 
ployed. 
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4.0  Some  Theory  Related  to 
Finite- Volume  Solvers 


For  purposes  of  these  notes,  we  consider  nu¬ 
merical  methods  for  solving  conservation  law  equa¬ 
tions. 


Definition:  A  conservation  law  asserts  that  the 
rate  of  change  of  the  total  amount,  of  a  substance 
with  density  2  in  a  fixed  region  17  is  equal  to  the 
flux  F  of  the  substance  through  the  boundary  <317. 


f  zda+  [  F(z)-ndi  =  0  (integral  form) 
ot  Jci  Jon 

The  choice  of  a  numerical  algorithm  used  to 
solve  a  conservation  law  equation  is  often  influ¬ 
enced  by  the  form  in  which  the  conservation  law  is 
presented.  A  finite-difference  practitioner  would 
apply  the  divergence  theorem  to  the  integral  form 
and  let  the  area  of  17  shrink  to  zero  thus  obtaining 
the  divergence  form  of  the  equation. 

— -z  +  V  •  F(z)  ?=  0  (divergence  form) 

The  finite-element  practitioner  constructs  the  di¬ 
vergence  form  then  multiplies  by  an  arbitrary  test 
function  <j>  and  integrates  by  parts. 


I-  f  <j>z  da—  f  V<j}-F{z)do.+  f  <i>F(z)-n<1 1  =  0 
ot  Jn  Jm 

(weak  form) 

Algorithm  developers  starting  from  these  three 
forms  can  produce  seemingly  different  numerical 
schemes.  In  reality,  the  final  discretizations  are 
usually  very  similar.  Some  differences  do  appear 
in  the  handling  of  boundary  conditions,  solution 
discontinuities,  and  nonlinearities.  When  consid¬ 
ering  flows  with  discontinuities,  the  integral  form 
appears  advantageous  since  conservation  of  fluxes 
comes  for  free  and  the  proper  jump  conditions  arc 
assured.  At  discontinuities,  the  divergence  form 
of  the  equations  implies  satisfaction  in  the  sense 
of  distribution  theory.  Consequently,  at  disconti¬ 
nuities  special  care  is  needed  to  construct  finite 
difference  schemes  which  produce  physically  rel¬ 
evant  solutions.  Because  the  test  functions  have 
compact  support,  the  weak  form  of  the  equations 
also  guarantees  satisfaction  of  the  jump  condi¬ 
tions  over  the  extent  of  the  support.  The  di¬ 
vergence  form  o i  the  equations  is  rarely  used  in 
the  discretization  of  conservation  law  equations 
on  unstructured  meshes  because  of  the  difficulty 
in  ensuring  conservation.  On  the  other  hand,  the 
integral  and  weak  forms  are  both  used  extensively 


in  numerical  modeling  of  conservation  laws  on  un¬ 
structured  meshes.  In  the  next  section,  the  sim¬ 
plest  of  numerical  schemes  based  on  integral  and 
weak  forms  of  the  conservation  law  are  compared 
to  illustrate  their  similarities.  These  schemes  can 
be  viewed  as  the  “central-difference”  counterparts 
on  unstructured  grids.  For  advection  dominated 
flows,  these  algorithms  are  inadequate  and  addi¬ 
tional  terms  must  be  added.  This  topic  is  under¬ 
taken  in  detail  in  future  sections. 

4.2  Comparison  of  Finite- Volume  and  Galcrkin 
Finite-Element.  Methods 

Although  the  integral  and  weak  forms  of  the 
equations  appear  to  be  quite  different,  numerical 
schemes  based  on  these  forms  often  produce  iden¬ 
tical  discretizations.  To  demonstrate  tins  point, 
consider  the  Galcrkin  discretization  (with  linear 
elements)  of  a  general  model  advection-diffusion 
equation  (/<  >  0): 

^  +  V.F(:)  =  V-,iVJ 

Multiplying  by  a  test  function  <!>  and  integrating 
by  parts  over  the  region  Q  produces  the  weak 
form  of  the  equation. 

-f-  \  (j)z  da-  f  ■  F {z)da-\-  [  <p  F(c)  •  n  dl 
ot  Jn  Jn  Jon 

-  -  —  f  /<V  <f>  •  V  z  d  a  +  I  fi<i>Vz-ndl 
Jn  Jvn 

(4.0) 

In  the  finite-element  method,  the  entire  domain 
is  first,  divided  into  smaller  elements.  In  this  case, 
the  elements  are  triangles  Tj,  such  that  $2  —  U Tj, 
Tj  fl  Ti  —  ill,  l  -■£  j.  In  Fig.  4.1a  wc  show  a  rep¬ 
resentative  vertex  with  adjacent  neighbors.  (To 
simplify  the  discussion  in  the  remainder  of  these 
notes,  we  adopt  the  convention  that  the  index 
“j”  refers  to  a  global  index  of  a  mesh  whereas 
the  index  “t”  always  refers  to  a  local  index.)  The 
linear  variation  of  the  solution  in  each  triangle  Tj 
can  be  expressed  in  terms  of  the  three  local  nodal 
values  of  the  solution,  z§.  i  —  1,2,3,  and  three 
element  shape  functions  n ; ,  i  =  1,2,3. 

3 

y)r,  =  53 ».■(*,#)  4,,i 
1=1 

(local  representation) 
Each  clement  shape  function  n,  can  be  interpreted 
as  a  piecewise  linear  surface  which  takes  on  a  unit 
value  at  u,  and  vanishes  at  the  other  two  vertices 
of  the  triangle  as  well  as  everywhere  outside  the 
triangle.  The  solution  can  also  be  expressed  glob¬ 
ally  in  terms  of  nodal  values  of  the  solution  and 


t 


f 

t 
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global  shape  functions. 

nodes 

(global  representation) 
In  this  form,  the  global  shape  functions  are  piece- 
wise  linear  pyramids  which  are  formed  from  the 
union  of  all  local  shape  functions  with  have  unit 
value  at  vj .  These  global  shape  functions  also  en¬ 
joy  compact  support,  i.e.  they  vanish  ontside  the 
region  Q,  formed  from  the  union  of  all  triangles 
incident  to  vj .  A  global  shape  function  for  vertex 
t<j  is  shown  in  Fig.  4.1b. 


Mesh 

Median  Dual 
Centroid  Dual 


1 

Figure  4.1a  Local  mesh  with  centroid  and  me¬ 
dian  duals. 


4 


Figure  4.1b  Global  shape  function  for  vertex  v0 
(not  labeled). 

The  Galcrkin  finite-element  method  assumes 
that  the  class  of  test  functions  is  identical  to  the 
class  of  functions  approximating  the  solution.  The 
simplest  test  functions  of  this  sort  are  the  in¬ 
dividual  shape  functions.  To  obtain  a  Galerkin 
discretization  for  a  typical  vertex  vj,  simply  set 
<j>h  =  Nj  and  evaluate  (4.0)  in  Qj.  Since  4>  van¬ 
ishes  on  dilj  equation  (4.0)  simplifies  to  the  fol¬ 
lowing  form: 


•F(zh)  da 


■Vzh  da 


Before  evaluating  equation  (4.1),  it  is  useful  to 
introduce  more  notation  concerning  the  geometry 
of  figure  4.1a.  Figure  4.2  depicts  the  index  and 
normal  convention  which  will  be  used  throughout 
these  notes.  The  triangle  with  vertices  0,  i,  and 
f  +  1  is  denoted  as  T1+ x  /<_,  This  index  convention 
will  be  used  for  other  quantities  such  as  areas  and 
gradients  which  are  computed  in  Ti+1/2. 


Figure  4.2  Vertex  v0  and  adjacent  neighbors. 

It  is  convenient  to  define  normals,  h,  for  straight 
edges  which  are  scaled  by  the  length  of  the  edge. 
Using  this  notation,  a  simple  formula  exists  for 
the  gradient  of  the  numerical  solution  in  a  triangle 
Ti+ 1/2- 

VS-‘+l/2  =  -  (*0*h-M/2  +  -  S.Vrii) 

^i+1/2 

(4.2) 

The  gradient  of  the  test  function  in  each  trian¬ 
gle  takes  a  similar  form  (replace  z  by  <f>  in  the 
previous  formula  with  ~  =  0). 

^H-l/2  =  7T7 - n.+  l/2  (4.3) 

"*“i+ 1  /2 

The  discrete  form  of  eqn.  (4.0)  is  now  written  as 


(4.4) 

The  flux  integral  can  be  evaluated  by  exact  inte¬ 
gration  (when  possible)  or  numerical  quadrature. 
In  this  case,  the  latter  is  assumed. 

/ F (zh)da  =  (F (4)  +  F(zj)  +  F (z^)) 

(4-5) 

The  diffusion  term  is  also  evaluated  with  Vzu 
constant  in  T1+1/2  and  /q+i/2  the  area  weighted 
average  /». 

/  da  =  Ai+1/2  ^i/2^zi+i/2  (4.6) 

JT>+t/ 1 


i 


(4.1) 


ft-.'l 


This  simplifies  (4.0)  considerably. 

I  if  da 

rf(fo)  , 

+  E  §*+»/*  ■  (F(*c)  +  F(*f)  +  F(*?+1)) 
1=1 
<H”o) 

-  E  o^'+l/Z  “i+1/2  ■  Vi+l/22,‘ 

i=l 

(4.7) 

Equation  (4.7)  represents  a  Galerkin  discretiza¬ 
tion  of  the  model  equation  assuming  piecewise 
linear  functions.  Note  that  as  far  as  the  geom¬ 
etry  is  concerned,  only  the  exterior  normals  of 
Hj  appear.  Conspicuously  absent  are  the  normal 
vectors  for  interior  edges.  This  strengthens  our 
confidence  that  we  can  show  an  equivalence  with 
a  finite-volume  discretization  on  nonoverlapping 
control  volumes.  To  show  this  equivalence,  note 
that  the  flux  term  can  be  manipulated  using  the 

identity  ]C?al°^«+i/2  =  0  ‘nto  a  *orm  *n  which 
the  relevant  geometry  is  any  path  connecting  ad¬ 
jacent  triangle  centroids  (R  denotes  the  spatial 
position  vector): 


<K  "o ) . 

E  •  (F(~o)  +  F (zf)  +  p {*?„)) 

»=1 

d(vo)  - 

-Eg  (p(*»)  +  F(2f))  ■  (*+1/2  +  fii-1/2) 

l  —  1 


=  E  s(F<*ofc)  +  F(*?))-  /  « 


dl 


1*1 

d(vo) 


^R-I 


V  v  o  /  *  * 

=  E  5  (F<4) +*<*?»•  / 

.=1  Z 


i(Ro  +  R,  +  Ri  +  1) 

n  dl 

j(Ro+R,+Ri-i ) 


(4.8) 

The  diffusion  term  also  simplifies  using  this  iden¬ 
tity. 


li(vo) 

E 


i=i 


1  » 

2n<+1/2 


h 


</(«■») 

=  E  *+  l/2^7«'+l/2^h  • 
1=1 


L 


J(Ro+Ri+i ) 


n  <i/ 


i(R«+R0 


(4.9) 

To  obtain  a  single  consistent  path  for  the  inte¬ 
grations  appealing  in  equations(4.8)  and  (4.9)  re¬ 
quires  that  the  path  pass  through  the  centroid 
of  each  triangle  and  the  mid-side  of  each  inte¬ 
rior  edge.  The  path  formed  by  connecting  these 
points  by  line  segments  is  precisely  the  median 
dual  of  the  mesh.  This  dual  completely  covers 


the  domain  (no  holes)  and  represents  a  consis¬ 
tent  and  conservative  finite- volume  discretizat  ion 
of  the  domain  which  is  spatially  equivalent  to  the 
Galerkin  approximation.  The  scheme  can  now  be 
written  in  a  finite-volume  form 

<*(*» O) 

~<f>hzh  da  +  £(H  -»)i  =  0  (4.10) 

1=1 

where  H  is  the  numerical  flux  of  the  finite- volume 
discretization 


(H  •  tx)i  =i(F (2*)  +  F \zt))  •  /R,+  n  dl 

.  rR,r 

-  /  n  dl 

.  [K+w 

-  «.+i/2V«-ti/2^  •  /  n  dl 

J  R"‘ 

(4.11) 

and  R;+1/->  is  the  centroid  of  T1+1/2,  R'+ ,  y.>  = 

^(Ro  +  R;  +  R;+i)  and  R"1  is  the  midpoint  of 
the  edge  e(v0,  t>,),  R”‘  =  |(R.0  +  R,). 


Conclusion:  The  spatial  discretization  produced 
by  the  Galerkin  finite-element  scheme  with  lin¬ 
ear  elements  has  an  equivalent  finite-volume  dis¬ 
cretization  on  nonoverlapping  control  volumes  wiih 
bounding  curves  which  pass  through  the  centroid 
of  triangles  and  midside  of  edges.  One  such  set 
of  control  volumes  satisfying  these  constraints  is 
the  median  dual. 

We  now  need  to  ask  if  the  time  integrals  pro¬ 
duce  identical  “mass”  matrices  for  the  Galerkin 
finite-element  and  finite-volume  schemes.  The 
answer  to  this  question  is  no.  In  fact,  these  ma¬ 
trices  are  not  the  same  in  one  space  dimension. 
The  Galerkin  mass  matrix  for  a  simple  1-D  mesh 
with  uniform  spacing  produces  a  row  of  the  mass 
matrix  with  the  following  weights: 

1  /n(  +4*'  +  '"l) 

(Finite  -  Element) 
Tiie  finite  volume  scheme  on  “median”  dual  pro¬ 
duces  the  following  -weights: 

(Finite  -  Volume) 
Although  the  finite-volume  matrix  gives  better 
temporal  stability,  the  finite-element  mass  matrix 
is  more  accurate. 
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4.3  Bdtre  Formulas 

The  first,  tenn  appearing  on  the  rigut-hand- 
sidc  of  equation  (4.10) 


-(F(4‘)  +  f(4„ 


,  i/a 

I  ndl 


suggests  a  computer  implementation  using  an  edge 
data  structure.  The  fluxes  in  this  term  are  eval¬ 
uated  at  the  two  endpoints  of  an  edge.  The  geo¬ 
metrical  terms  could  be  evaluated  edgewise  if  the 
midpoint  of  the  edge  and  the  centroids  of  the  two 
adjacent  cells  are  known.  Recall  that  the  edge 
data  structure  (described  in  section  1.3)  for  a  2- 
I)  mesh  supplies  this  information  for  each  interior 
edge  of  the  tnesli,  i.e.  the  structure  provides  for 
each  edge 

(1)  The  two  vertices  which  form  the  edge. 

(2)  The  two  adjacent  cell  centroids  (or  a  pointer 
to  centroid  values)  which  share  the  edge. 

More  generally,  if  the  solution  is  assumed  to  vary 
linearly  within  each  triangle  then  edge  formulas 
can  he  derived  for  discretized  forms  of  the  gra¬ 
dient.,  divergence,  Hessian,  and  Laplacian  oper¬ 
ators.  As  we  will  see,  the  formulas  can  be  de¬ 
rived  from  either  a  finite-volume  or  finite-element, 
point-of-view  with  essentially  identical  results. 

4.3a  Gradient  and  Divergence  Edge  Formulas 

As  a  first  example,  we  will  derive  an  edge 
formula  for  the  integral  averaged  gradient  of  a 
function  it,  f  Vit  da,  for  the  the  region  de¬ 
scribed  by  the  union  of  all  triangles  which  share 
the  vertex  t>o,  see  figure  4.1a.  If  the  discrete  solu¬ 
tion  u"  varies  linearly  in  each  triangle  T  then  the 
gradient  is  constant  and  the  integration  exact. 


da=  £  (W-Mr 
TCfto 


(4.12) 


Identical  results  tire  obtained  by  approximating 
the  riglit-hand-side  of  (4.13)  by  trapezoidal  quadra¬ 
ture  (exact  for  piecewise  linear  uh) 

<f  uk  n  d\  i(u?  +  «!*+1)iii+1/2  (4.14) 

Josio  l€jo  ^ 

where  To  =  {1,2,  ...,6}  and  fij+1/2  is  the  vector 
perpendicular  to  the  edge  I'i+i)  with  magni¬ 
tude  equal  to  the  length  of  the  edge.  The  sum¬ 
mation  can  be  rearranged  to  yield 


4  constant  solution  can  be  added  to  (4.15)  since 
the  gradient  of  a  constant  function  is  exactly  zero 
in  this  discretization.  In  particular,  we  add  the 
value  of  uh  at  vertex  «q. 

«'*  ndl  =  £  i(u£  +  u?)(n,+1/2  -f  *Si_i/2) 

>€lo 

(4.15) 

Once  again  using  the  fact  that  for  any  closed 
curve  / 11  dl  =  §d n  =  0  which  implies  that 


fii+t/2  +  “1-1/2 


for  any  path  connecting  «,_i  and  vi+l,  This  path 
integral  represents  a  vector  which  is  parallel  in  di¬ 
rection  and  three  times  the  magnitude  of  the  vec¬ 
tor  n  obtained  by  computing  the  integral  for  any 
simple  path  connecting  the  centroids  of  the  two 
triangles  which  share  the  edge  e(t;0,  «,•).  d  n  = 

3/J ’’  dii  =  3noi-  This  reduces  the  gradient  for¬ 
mula  to  the  following  form: 

uh  n  dl  =  ^(«„  -f  u-Jnn,-  (4.17) 

Jodo  2 


The  vertex  lumped  average  gradient  at  vertex  is 
then  given  by 


Equation  (4.12)  would  suggest  computing  the  gra¬ 
dient  in  each  triangle  sharing  v<j  and  accumulat¬ 
ing  the  area  weighted  sum.  If  integral  averaged 
gradients  arc  required  at  all  vertices  then  the  gra¬ 
dient  in  each  triangle  could  be  competed  and  the 
area  weighted  result,  scattered  to  the  three  ver 
ticcs  of  the  triangle  for  accumulation.  We  re¬ 
fer  to  this  as  the  element-by-element  approach. 
A  Green’s  formula  would  suggest  a  different  ap¬ 
proach  for  the  same  task. 


=  (4.18) 

An°,7?,2 

It  is  well  known  that  the  region  bounded  by  the 
“median”  dual  at  vertex  no,  (shown  in  figure  4.1a) 
has  an  area  Aq  which  is  exactly  jAq0.  Therefore, 
using  the  median  dual  we  obtain  a  formula  which 
appears  to  represent  some  approximate  quadra¬ 
ture  of  the  right-hand-side  of  (4.13)  on  nonover¬ 
lapping  regions. 


Vn  da  = 


f  11  n  <1 1 
Jon  0 


(4.13) 


(Vu'>)Uo  =  i-^^uo+M?)fio- 

0  0 


(4.19) 


fi-33 


A  naive  interpretation  of  equation  (4.19)  would 
probably  conclude  that  this  equation  is  a  rather 
poor  approximation  to  (4.13).  It  is  not  obvious 
from  (4.19)  that  the  gradient  of  a  linear  func¬ 
tion  u  is  computed  exactly.  FYom  the  origin  of 
this  formula,  we  now  know  that  this  formula  can 
be  obtained  from  a  trapezoidal  quadrature  on  a 
slightly  larger  region  and  is  exact  within  the  class 
of  linear  polynomials. 

Keep  in  mind  that  a  constant  solution  could 
have  been  subtracted  instead  of  added  from  equa¬ 
tion  (4.15)  which  would  have  given  a  different  but 
equivalent  form  of  (4.19). 


This  formula  does  not  appear  to  resemble  any 
approximate  quadrature  of  (4.13). 

Equation  (4.19)  suggests  an  algorithm  using 
an  edge  data  structure  which  is  quite  different, 
from  the  element-by -element  method  (4.12).  The 
edge  based  calculation  consists  of  the  following 
steps- 

Sample  Gradient  Computation 

(1)  (Precomputation)  For  each  edge  <?(?>,,  vj)  gather 
the  centroid  coordinates  of  the  t\  o  adjacent  cells, 
v[  and  Vj. 

(2)  (Precomputation)  For  each  edge  compute  the 
dual  edge  normal  n,j  from  the  centroid  coordi¬ 
nates  n,j  =  fj  d  it.  (Orient  from  »»f  to  vj  it 
'<./)• 

(3)  For  each  edge  <•(<>,,  Vj)  gather  the  values  of  the 
function  at  the  two  vertices,  u  •'  and  ufj . 

(4)  For  each  edge  compute  the  arithmetic  average 
ami  multiply  by  the  dual  edge  normal,  i(»i;  -f 

(5)  For  each  edge  scatter  and  accumulate  the  re¬ 
sult  at  vertex 

(6)  For  each  edge  negate,  scatter  and  accumulate 
the  same  result  at  vertex  Vj. 

(7)  For  each  vertex  compute  the  final  gradient,  by 
dividing  the  accumulated  result  by  area  of  the 
median  dual,  A0. 

This  algorithm  conforms  perfectly  within  the  edge 
data  structure.  In  practice  all  the  geometrical  fac¬ 
tors  could  be  precomputed  and  stored  in  memory 
by  edge  thereby  eliminating  a  gather.  The  sample 
algorithm  described  above  serves  as  a  template 
for  all  the  remaining  algorithms  described  in  the 
rest  of  this  section. 


The  gradient  and  divergence  operators  are 
related  so  that  it  is  not  surprising  that  the  dis¬ 
cretization  of  the  divergence  operator  produces  a 
similar  formula: 

div(Fh)do  =  £  |(F*  +  F?)  •  » o,  (4.21) 
<€7o 


The  Galcrkin  weighted  finite  element  integrals  with 
linear  elements  produce  essentially  identical  re¬ 
sults.  In  this  case  a  piecewise  linear  weighting 
function  is  introduced  (see  figure  4.1b).  The 
gradient  and  divergence  formulas  (introduced  ear¬ 
lier) 


4>h  Vuhda 


=  ]C  5(«o 


(4.22) 


differ  from  *he  previous  formulas  by  a  constant 
factor  of  1/3.  For  example,  if  a  lumped  approxi¬ 
mation  to  the  left-hand-side  of  (4.22)  is  assumed, 
then  (4.19)  is  recovered  since 


chhSrv.hdc  «  (Vxth)Vt,A0. 


(4.24) 


4.3b  2-D  Hessian  and  Laplacian  Edge  Formulas 

We  begin  by  approximating  the  following  ma¬ 
trix  of  second  derivatives 


V/t(Vu)T 


(/U<j-)j-  (/*U,,)j, 


(4.25) 


using  a  standard  Galcrkin  approximation  for  the 
region  J70  formed  from  the  union  of  all  triangles 
that  share  the  vertex  u<j-  To  do  so,  multiply  (4.25) 
by  the  weight  function  <j>  and  perform  integration 
by  parts  over  Q0  assuming  </>=•=  0  on  dffo- 


(4.26) 

where  Tj+1/2  =  simplex  (t'o,  v„  vi+l).  Using  the 
notation  of  figure  4.2,  gradients  of  the  piecewise 
linear  functions  <j>h  (figure  4.1b)  and  uh  are 


(W'-)/Ul„  =  - 


2Aj+i/2 


n;+i/2 


(4.27) 
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(V«,‘)/Wl/J  =  — - («o“i+I/2+«*?fii+l-«i+lfi.-) 

*^1+1/2 

(4.28) 

where  /ll+l/2  is  the  area  of  7’,+ 1/2  and  «,+1/2 
is  the  vector  normal  to  the  edge  e(t>i,  o,+i)  with 
magnitude  equal  to  the  length  of  the  edge. 

For  piecewise  linear  uh,  the  gradient  is  con¬ 
stant  in  each  triangle.  The  integral  average  ma¬ 
trix  of  second  derivatives  simplifies  to  the  follow¬ 
ing  form: 

[<f>hV  n(Vuh)r  da 
■h0 

- - «•'+'/-'  /  t<Vu,‘)’rda 

fclAi+i,,  J Ti+1/a 

=E  „Jf>* 

•€lo  +l 11 

,r7  ..h\T 

(GTo 

(4.29) 

where  /Il+1/,2 *s  the  integral  average  of  //  in  T,+  1/2 
Inserting  the  triangle  gradient  formula,  we  obtain 
a  discretized  formula  for  the  Galerkin  integral. 

/  /V/i(Vu*)7'  ci« 

Jn0 

- J  L  1  ,  ».+  l/2(«0>‘,+  I/2 

4  -li+2/2 


+  «?»*/+ 1  - 


(4.30) 

Regrouping  terms  and  removal  of  a  constant  so¬ 
lution  yields  the  following  simplified  form 

f  <l>hV  fi{Vnh)r  da  f  V/i(V(«fc  -  «J))T  da 

‘'Oo 


=  £  Mi ( .if  -  «$) 

<G7(i 


(4.31) 


M  —  —  -  —  '  *>i+ 1  /2  ( 1 ) 

4  L'4*+ 1/2 


Ml  — 1/2 

-  - - ni_i/2(n,-.|) 

v*«— 1/2 


.  (4.32) 


Even  though  this  formula  is  very  simple,  it  is 
not  compatible  with  the  edge  data  structure  men¬ 
tioned  earlier.  Using  some  simple  identities,  we 
will  now  rewrite  the  weight  formula  in  a  form 
which  is  compatible  with  the  edge  data  structure. 
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Figure  4.3  Local  geometry  configuration. 

Refer ring  to  figure  4.3,  we  have  the  following  vec¬ 
tor  identities: 

1  -  -  „ ..  1  _ 
n,_i/2  =  3n r(  -  -Hi,  n,_i  =  3n«,  -f  -n, 


and  similarly 


(4.33) 


di+1/2  —  3“r„  +  2**''  n<+i  =  +  ^n,-. 

(4.34) 

It  is  useful  to  decompose  the  tensor  product,  terms 
into  symmetric  and  skew-symmetric  parts,  for  ex¬ 
ample: 

-n,_,/2(»ii-i)T=  (-ii iHj  -  9ii/jii]j) 

N  -  v  ' 

Symmetric 

+  ^ («.»«-  nitnf) 

< - - - ' 

vie 


».  M/2(n.+, )r  =  ( 9i»/.n'[) 

v- - - - - - ' 

Symmetric 


+  o (“•»/.-  ni.ni ) 


Skvw— symmetric 


Upon  dividing  by  the  area  terms,  some  simple 
algebra  reveals  that 


»i-i/2(n.-i)7’  _  (Ou/i.Hff,  -  |ii tu[)  +  T  ()  1 

Ai-1/2  -4,' -i/2  .-1  0 

(4.35) 

and  similarly 


n.+  i/2(ni+i)-  _  (ffin/  -  |  0  1 

Ai+1/2  -4l+i/>  -1  0 

(4.36) 
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So  in  summary  wo  have  that 


-I 


~;r~~  “•+ !  n  (***+ 1  )T 

"i+l/2 


Mi— 1/2  -  ,  x7‘ 

-- — — ni-j/a^n.i)' 
'1.  —  1/-2 


-  or>/„n[, 


+/‘i-l/L> 


d,-f  1  /2 

i»iW?  - 


A- 1/ 

"KMi+l/2  ~  Mi-1/2) 


■  0 

ll 

-1 

0 

J  J 

.  (4.37) 


The  second  form  is  compatible  with  an  edge  data 
structure  where  edge  vertices  and  adjacent  cell 
centroids  are  known. 

4.3c  2-D  Ed  ire  Discretisation  of  V  ■  uVu 

Calculation  of  this  term  amounts  to  sum¬ 
ming  diagonal  entries  of  the  previous  result.  Can¬ 
cellation  of  terms  leaves  a  reduced  form. 


/  -A'*  V  •  /iVu'1  da  —Trace  /  4>h  V  n(Vuh)r  da 


Y w(«?  -  «o) 

te/o 


IV  --Ifv  gi-H/3  1 
w*  4  r+|/2  >t  j .  m 

.  _  “i— 1/2  '  *‘i-t  1 

H' J 


(4.38) 

(4.39) 


The  area  of  a  triangle  can  be  expressed  in  terms  of 
the  magnitude  of  the  cross  product  of  the  sealed 
edge  normals. 


-^ift/2  =  2l“'+lf2  *  “H  i I 
•di-l/2  =  J|n.-l/2  X  »Vl| 


W,  = 


1  |_  (“i-f  1/2  •  >*i-h) 

2  r+‘/2F«i+t/2xnj+1| 


”  Mi- 1/2 


(n,-l/2  -n,-i) 
|n,_i/2  x 


(4.40) 


Fiually  we  can  express  the  dot  amt  cross  products 
in  terms  of  the  local  angles  as  sketched  in  figure 


4.4. 


Figure  4.4  Local  angles  for  triangles  sharing  edge 


nH-l/2  ‘  fij+i 

|n.-+i/2  x  iii+il 


and 


cosjo^J 
sin  (07, ,) 


-cotan(a/,, ) 
(4.41) 


“i-1/2  ~  iij— 1 

|>Vl/2  X  lli_i  | 


cos  (an,) 

sin  (oh,) 


-cotan((.VH . ) 


(4.42) 

Inserting  these  formulas  yields  a  particularly  sim¬ 
ple  form  of  the  weight  factors  IF(: 


M,+ i/2l'Otan(o/„)  -|-  7*i_x /2cotivn(rt )] 

(4.43) 

Equation  (4.43)  is  particularly  useful  in  theoreti¬ 
cal  studios. 

4.3d  3-D  Hessian  and  Laplaeian  Edge  Formulas 

As  in  the  2-D  case,  we  begin  with  the  Galcrkin 
integral  equation  for  the  Hessian-like  matrix  of 
derivatives. 

I  <AhV  ft(Vuh)T dv  =  -j(W)  (Vuh)rdv 

In  this  formula  is  the  volume  formed  by  the 
union  of  all  tetraheura  that,  share  vertex  wq.  Fol¬ 
lowing  a  procedure  identical  to  the  2-D  case,  we 
can  derive  the  analogous  3-D  edge  formula  for  the 
matrix  of  second  derivatives 


/  <AhV  /<(Vuh)r  dv  =  r  Mj(iti  -  «o)  (4.44) 
Jvx'» 


where 


«t(t>o,V,) 

M  =  -5  E  "" 


5  h 


45) 


i 


i 


5 


i 


10  is  thi'  si't  of  indices  of  all  adjacent.  neighbors  of 
t>o  connected  by  incident,  edges,  A*  a  local  cyclic  in¬ 
dex  describing  the  associated  vertices  which  form 
a  polygon  of  degree  d( t»o,  wj)  surrounding  the  edge 
<>(d0,  d{).  The  subscript  A  +  1/2  indicates  quanti¬ 
ties  associated  with  the  tetrahedron  with  vertices 
Co,  o,,  dr  and  o/t+i  as  shown  in  figure  '1.5. 


i 


Figure  4.5  Sot  of  totrahedra  sharing  edge  e{  i>0,  v, ) 
with  local  cyclic  index  k. 

4.3e  3-D  Edge  Discretization  of  V  i>Vu 

Following  the  same  procedure  as  in  2-D,  we 
obtain: 


f  /V  -pVn*  dv  =  Trace  f  4>h'V nk)T dv 

vl'o  »'l'o 

=  ^  ~  W«) 


where 


(4,16) 


<l(  I'o.t'i)  - 

11/  1  "fc+l/2?  -*  .  I 

W'~  9  zL  Vfc+l/,  A+,/2  'sH  i/2  (4-‘17) 

It  can  be  shown  that  the  volume  of  a  tetrahedron 
is  given  by 


*+1/2  “  3  |ARfc+1/2| 


(4.48) 


where  is  the  length  of  the  edge  shared 

by  the  faces  associated  with  s\.+ ,  y2  and  s^.+  ,y.r 


,  J(*-0 ,t»»>  ^  .  .S’ 

°  )t=l  !s*.+  l/2XSfc+l/2| 

(4.49) 


Finally  we  can  rewrite  the  dot.  and  cross  product 
in  terms  of  the  cotangent  of  the  face  angle. 

4-fl/2  ‘4+1/*  __  _  <-Os(o^.H/2) 

14+ 1  /2  xirM  ,  n  I  sin  («++!  /2 ) 

=  -  colan(«r+i/2) 

As  in  the  2-D  case,  the  weights  IF,  now  have  a 
particularly  simple  form: 

^  d(vt.Vi) 

~  ^  Y  4+ 1  /•.>  I A  74+ 1  /■_»  |cotau(rt*.+ 1  /■> ) 

fciS  I 

(4.50) 

4.4  Godunov  Finite- Volume  Schemes 

In  this  section,  we  consider  upwind  algorithms 
for  scalar  liyperbolic  equations.  In  particular, 
we  concentrate,  on  upwind  schemes  based  on  Go¬ 
dunov’s  method  [34]  and  defer  the  discussion  of 
“upwind’*  schemes  based  on  the  fluctuation  de¬ 
composition  method  or  the  Petrov-Galcrkin  for¬ 
mulation  (SUPG,  GLS)  to  the  lectures  of  Profs. 
Deeoninek,  Hughes,  and  Johnson. 

The  development  presented  here  follows  many 
of  the  ideas  developed  previously  for  structured 
meshes.  For  example,  in  the  extension  of  Go¬ 
dunov’s  scheme  to  second  order  accuracy  in  one 
space  dimension,  van  Leer  [35]  developed  an  ad- 
veetion  scheme  based  on  the  •  i-  Y  o  tum  of  dis¬ 
continuous  piecewise  linear  c.  ‘  * o •, * i. >■•«?  together 
with  Lagrangian  hydrodynm.  ,  Vu>n  thereafter, 
Colclla  and  Woodward  [.  dward  and 

Colella  [37]  further  extern,  c  ideas  to  in¬ 
clude  discontinuous  piecewi:  o  jarabolic  approx¬ 
imations  with  Eulerian  or  Lagrangian  hydrody¬ 
namics.  Harten  et.  al.  [38,39]  later  extended 
related  schemes  to  arbitrary  order  and  clarified 
the  entire  process.  Those  techniques  have  been 
applied  to  structured  meshes  in  multiple  space  di¬ 
mensions  by  applying  one-dimensional-like  s<  homes 
along  individual  coordinate  lines.  This  has  proven 
to  be  a  highly  successful  approximation  but  does 
not.  directly  extend  to  unstructured  meshes.  In 
reference  (40),  we  proposed  a  scheme  for  multi¬ 
dimensional  reconstruction  on  unstructured  meshes 
using  discontinuous  piecewise  linear  distributions 
of  the  solution  in  each  control  volume.  Mono- 
tonicity  of  the  reconstruction  was  enforced  using 
a  limiting  procedure  similar  to  that,  proposed  by 
van  Leer  [35]  for  structured  grids.  In  a  later  pa¬ 
per  (Barth  and  Frcdcrickson  [41]),  we  developed 
numerical  schemes  for  unstructured  meshes  utiliz¬ 
ing  a  reconstruction  algorithm  of  arbitrary  oili  n'. 
Portions  of  the  discussion  presented  here  is  taken 
from  these  papers. 


4,4a  Generali/, c  l  jSfflduuoy  Scheme 

Wo  begin  by  considering  (.lie  integral  con¬ 
servation  law  for  some  domain,  0  and  its  tes¬ 
sellation  T({ I)  comprised  of  cells,  Cj,  Cl  —  Uc'j, 
sa  t),  k  ^  j.  The  integral  equation  is  valid 
for  the  entire  domain  Q  as  well  as  in  each  cell  (or 
possibly  dual  cell): 

—  /  ti <la  [  P(u)  •  n<//  =  0  (4.5lo) 

Jc}  JOci 

Fundamental  to  Godunov's  method  is  the  cell  av¬ 
erage  of  the  solution,  TI,  in  each  cell. 

/  t<  da  -  UjAj  (4.52) 

,/oj 

In  Godunov’s  method  and  the  higher  order  ac¬ 
curate  extension  consul  red  here,  these  cell  av¬ 
erages  are  treated  as  the  fundamental  unknowns 
(degrees  of  freedom). 

l  F(ti)  •  mil  =  0  (1.516) 

<■"  JftCj 

The  solution  algorithm  for  (4.51l>)  is  a  relatively 
standard  procedure  for  extensions  of  Godunov's 
scheme  in  Eulorian  coordinates  [34-39].  The  ba¬ 
sic  idea  is  to  start  with  piecewise  constant  data  in 
each  cell  with  value  equal  to  the  integral  t  ell  aver¬ 
age,  Using  information  from  cell  averages,  k  -  th 
order  piecewise  polynomials  are  reconstructed: 

u  (■*  i  ll)  ~  ^  (>n,it)(-*  •*  <  ■■ !/  !/<■) 

»»»+«<& 

(4.53) 

where  P(„Kn)(x  -  .r,.,  y  -  ;/t.)  =  (.r  - .r,.)m(j/  -  y,.)" 
and  (,i>,  »/,.)  is  the  cell  centroid.  The  process  of 
reconstruction  amounts  to  finding  the  polynomial 
eoellicients,  Near  steep  gradients  and  dis¬ 

continuities,  those  polynomial  eocfiicionts  maybe 
nltered  based  on  monotonieity  arguments.  Be 
cause  the  reconstructed  polynomials  vary  discon- 
tinuously  from  cell  to  cell,  a  unique  value  of  the 
solution  does  not.  exist,  at  cell  interfaces.  This 
nonuniqueness  is  resolved  via  exact  or  approxi¬ 
mate  solutions  of  the  Riemann  problem.  In  prac¬ 
tice,  tins  is  accomplished  by  supplanting  the  true 
(lux  function  in  (4.51)  with  a  numerical  flux  func¬ 
tion  (described  below)  which  produces  a  single 
unique  flux  given  two  solution  stales.  Once  the 
flux  integral  in  (4.51)  is  carried  out  (either  ex¬ 
actly  or  by  numerical  quadrature),  the  cell  aver¬ 
age  of  the  solution  can  be  evolved  in  time.  In 
most  cases,  standard  techniques  for  integrating 


ODE  equations  are  used  for  (lie  time  evolution, 
i.e.  Euler  implicit,  Euler  explicit,  Runge-Kutta. 
The  result  of  the  evolution  process  is  a  new  col¬ 
lection  of  cell  averages,  The  process  can  t  hen  lie 
repeated.  The  process  can  be  summafVM'd  in  the 
following  steps: 

(1)  Reconstruction  in  Each  Cell:  Given  inte¬ 
gral  cell  averages  in  all  cells,  reconstruct  piecewise 
polynomial  coefficients  «•(„,,„)  for  use  in  equa¬ 
tion  (4.51).  For  solutions  containing  disconti¬ 
nuities  and/or  steep  gradients,  monotonieity  en¬ 
forcement.  may  be  required. 

(2)  Flux  Evaluation  on  Each  Edge:  Consider 
each  cell  boundary,  dej,  to  be  a  collection  of  edges 
(or  dual  edges)  from  the  mesh.  Along  each  edge 
(or  dual  edge),  perform  a  high  order  accurate  flux 
quadrature. 

(3)  Evolution  in  Each  Cell:  Collect  flux  con¬ 
tributions  in  each  cell  and  evolve  in  time  using 
any  time  stepping  scheme,  i.e.,  Euler  explicit,  Eu¬ 
ler  implicit,  Runge-Kutta,  etc.  The  result,  of  this 
process  is  once  again  cell  averages. 

By  far,  the  most  difficult,  of  those  steps  is 
the  polynomial  reconstruct  ion  given  cell  averages. 
In  the  following  paragraphs,  we  describe  design 
criteria  for  a  general  reconstruction  operator. 

Reconstruction 

The  reconstruction  operator  serves  as  a  finite- 
dimensional  (possibly  pseudo)  inverse  of  the  cell- 
averaging  operator  A  whose  j-tli  component  A, 
computes  the  ceil  average  of  the  solution  in  c, . 


In  addition,  we  place  the  following  additional  re¬ 
quirements: 

(1)  Conservation  of  the  mean:  Simply  stated, 
given  cell  averages  TI,  we  require  that  all  poly¬ 
nomial  reconstructions  uk  have  the  correct  cell 
average. 

if  uk  —  R*'7I  then  Ti  -  Auk 

This  means  that  II*  is  a  light  inverse  of  the  av¬ 
eraging  operator  A. 


AR*  =  /  (4.55) 

Conservation  of  the  mean  has  an  important  impli¬ 
cation.  Unlike  finite-clement  schemes,  Godunov 
schemes  ham:  a  diagonal  mass  matrix. 


0* 


(2)  k-exactiiess:  We  say  that,  a  reconstruction 
operator  ItA'  is  k- exact.  if  R*A  reconstructs  poly¬ 
nomials  of  degree  k  or  less  exactly. 

if  u  G  Vt  and  t7  =  An,  then  tt^'  =  R*?7  =  u 

In  other  words,  R*  is  a  left-inverse  of  A  restricted 
to  the  space  of  polynomials  of  decree  at  most  k. 


R*A 


=  / 


(4.50) 


Figure  4.0b  Piecewise  quadratic  reconstruction, 
Fltat  Kvaiuatiou 


This  insures  that  exact  solutions  contained  in  7\ 
are  in  fact  solutions  of  the  discrete  equations.  For 
sufliciently  smooth  solutions,  the  property  of  k- 
exactness  also  issmvs  that  when  piecewise  poly¬ 
nomials  are  evaluated  at  cell  boundaries,  the  dif¬ 
ference  between  solution  states  diminishes  with 
increasing  k  at  a  rate  proportional  to  h*f+1  were 
/>  is  a  maximum  diameter  of  the  two  cells.  Figure 
1.0a  shows  n  global  quart  ic  polynomial  u  £  P| 
which  has  been  averaged  in  eneh  int «vrval. 


Figure  i.tia  Ceil  averaging  of  quart  ic  polyno¬ 
mial. 


rile  task  here  is  t.o  evaluate  the  (lux  integral 
appearing  in  (4.51), 


At  cell  interfaces,  two  distinct  values  of  the  so¬ 
lution  can  be  obtained  anywhere  on  the  bound¬ 
ary  of  the  control  volume  by  direct  evaluation  of 
the  piecewise  polynomials  in  the  two  cells  sharing 
the  interface.  For  brevity,  the  states  will  be  de¬ 
noted  by  »+  and  u~  which  should  be  interpreted 
as  (uk)+  and  ( uK')~  whore  ±  refers  to  which  piece- 
wise  polynomial  was  used  in  the  evaluation.  Rather 
than  use  a  numerical  flux  function  derived  from 
the  exact  solution  of  the  Riomann  problem,  we 
prefer  numerical  flux  functions  based  on  moan 
value  linearizations  As  we  will  see,  this  actu 

ally  makes  certain  stability  proofs  much  clearer. 
Define  /(«,  n)  =  F(«)  •  n  and  <i(u,  n)  =  /(»,  n)', 
the  mean  value  flux  function  is  given  by 

h(  ,  u' ,  >0  =•  -  (/(«+ ,  n)  f  f(u  ,  n)) 

7  (4.58) 

-ijHMOl  («+  -  «") 

where  /(t,,+  )  -  /(«")  =  o(u.n)(u+  -  u~)  and 
fi  ==  0u~  -f  (1  ~0)u+  for  some  0  £  [0, 1].  Using  the 
numerical  flux  function,  we  approximate  (4.57)  by 


Figure  4.6b  shows  a  quadratic  reconstruction  uk  c- 
V-2  given  the  cell  averages.  Close  inspe*  ti«u  of 
figure  4.01)  reveals  small  jumps  in  the  piecewise 
polynomials  at  interval  boundaries.  These  jumps 
would  decrease  even  more  for  rubies  and  vanish 
altogether  for  quart-ic  reconstruc  tion.  Property 
(1)  requires  that  the  area  under  each  piecewise 
polynomial  is  exactly  equal  to  the  cell  average. 


«”,n  )dl 


In  practice,  this  flux  integral  is  never  evaluated 
exactly,  except  when  the  data  is  piecewise  con¬ 
stant.  When  piecewise  linear  functions  are  used,  a 
midpoint  quadrature  formula  is  usually  employed. 
This  is  used  rather  than  the  slightly  more  accu¬ 
rate  trapezoidal  quadrature  because  it  requires 
only  one  flux  evaluation  per  edge  segment  while 


fl- 


the  trapezoidal  quadrature  requires  two.  When 
considering  schemes  with  reconst  ruction  order  A 
greater  than  one,  wo  suggest  in  (41]  that.  Gauss 
quadrature  formulas  he  used.  Recall  that.  N  point 
Gauss  quadrature  founnlas  integrate  2Ar-  1  poly¬ 
nomials  exactly.  '1'hese  quadrature  formulas  give 
the  highest  accuracy  for  the  lowest  number  of 
function  evaluations.  For  the  A-cxaet  reconstruc¬ 
tion  discussed  below.  /V  >  (k  - i  l)/‘2  point  Gauss 
quadrature  formulas  arc  used. 

4.5.  k -exact. Reconstruction 

In  this  section,  a  brief  account,  is  given  of 
t  he  reconstruction  scheme  presented  in  Barth  ami 
Frederickson  [41]  for  arbitrary  order  reconst  ruc¬ 
tion.  Upon  first  inspection,  t  he  use  of  high  order 
reconstruction  appears  to  be  an  expensive  propo¬ 
sition.  The  present  reconstruction  strategy  opti¬ 
mizes  the  efficiency  of  the  reconstruction  by  pro- 
computing  as  a  one  time  preprocessing  step  the 
set  of  weights  W j  in  each  toll  Cj  with  neighbor 
set  A?  such  that 

Ojiu.ri)  ^  ^  Crt.iU'^'  (‘I . >)91 

,€A'-j 

where  <»(,„,„)  are  the  polynomial  coefficients.  This 
effectively  reduces  the  problem  of  reconstruction 
to  multiplication  of  predetermined  weights  and 
cell  averages  to  obtain  polynomial  coefficients. 

During  the  preprocessing  to  obtain  the  re¬ 
construction  weights  Wj  a  coordinate  system  wit  h 
origin  at  the  centroid  of  c}  is  assumed  to  min¬ 
imi/.*  roundoff  errors.  To  insure  that  the  recon¬ 
struction  is  invariant  to  afliuc  transformations,  we 
then  temporarily  transform  (rotate  and  scale)  to 
another  coordinate  system  (7,7?)  which  is  normal¬ 
ized  to  the  i  ell  rv 


.T 

D,.t 

D  i  .:> 

.r 

ii 

£>•>., 

D  •>_;> 

«/ 

with  the  matrix  D  is  chosen  so  that 
Aj(7*)  -  A }(f)  =  1 
A j(xy)  -  Aj(j/.r)  =  t) 

Polynomials  on  c,  are  temporarily  represented 
using  the  polynomial  basis  functions 

P  ~  [1.  7,  jy.7".-*?/*  J/‘*7'\  ...]. 


Note  that  polynomials  in  this  system  are  easily 
transformed  to  the  standard  cell-centroid  basis 


Since  0  <  s  4*  t  <  k  and  0  <  t»  -}-  n  -  s  -  t  v  k ,  we 
can  reorder  and  rewrite  in  terms  of  the  standard 
and  transformed  basis  polynomials 

^  (•  iii! it 1 1.1,0  (4-00) 

M-r<fr 


Satisfaction  of  conservation  of  the  mean  is  guar¬ 
anteed  by  introducing  into  the  transformed  coor¬ 


dinate  system  zero  mean  basis  polynomials  1}  in 
which  all  but  the  first  have  zero  cell  average,  i.e 

=  (1,7, 77, 7*  -  l,777,77-i  -  l,”'1  -  .d^C/1), ...]. 
No  ■  that  using  these  polynomials  requires  a  mi¬ 
nor  modification  of  (4.00)  hut  retains  the  same 
form: 


n . .  =  e  c 


(4.01) 


Given  tliis  preparatory  work,  we  are  now  ready 
to  desnibe  the  formulation  of  the  reconstruction 
algorithm. 


We  note  that  the  set  of  cell  neighbors  A/) 
must  contain  at  least,  (k  4-  1  )(A  4-  2)/2  cells  cv  if 
the  reconstruction  operator  R*‘  is  to  bo  A-oxart. 
That  (A  4-  1  )(A  4-  2)/2  cells  is  not  sulticient.  in  all 
situations  is  easily  observed.  If,  for  example,  the 
cell-centers  all  lie  on  a  single  straight  line  one  can 
find  a  linear  function  u  such  that  A j{u)  =  0  for 
every  cell  c_,,  which  means  that  reconstruction  of 
a  is  impossible.  In  other  cases  a  A-oxact  recon- 

ofi'iw'tiivp  O  ^  *»»*»\»  pvMof  K«i*  dim  ♦#,  4  )i.v 

v  »<.-«»  ''j'vimwi  imv  '  aa'M  urn,  w  /  tnr 

geometry  may  be  poorly  condit  ioned. 

Our  approach  is  to  work  with  a  slightly  larger 
support  containing  more  t  turn  the  minimum  num¬ 
ber  of  cells.  In  this  case  the  operator  R*  is  likely 
to  be  nonunique,  because  various  subsets  would 
he  able  to  support  reconstruction  operators  of  de¬ 
gree  A.  Although  all  would  reproduce  a  polyno¬ 
mial  of  degree  A  exactly,  if  we  disregard  round¬ 
off,  they  would  differ  in  their  treatment  of  non- 
polynomials,  or  of  polynomials  of  degree  higher 
than  A.  Any  A-exact  reconstruction  operator  Rj 
is  a  weighted  average  of  these  basic  ones.  Our  ap 
proach  is  to  choose  the  one  of  minimum  Fl  ohonius 
norm.  This  operator  is  optimal,  in  a  certain  sense, 
when  the  function  we  are  reconstructing  is  not.  ex¬ 
actly  a  polynomial  of  degree  A,  hut  one  that  hits 
been  perturbed  hv  the  addition  of  Gaussian  noise, 
for  it  minimizes  the  expected  deviation  from  the 
unperturbed  polynomial  in  a  certain  rather  nat¬ 
ural  norm. 

As  we  begin  the  formulation  of  the  recon¬ 
struction  preprocessing  algorithm,  the  render  is 


6-40 


reminded  that  the  task  at  hand  is  to  calculate  the 
weights  Wy  for  each  cell  Cj  which  when  applied 
via  (4.59)  produce  piecewise  polynomial  approx¬ 
imations.  We  begin  by  first  rewriting  the  piece- 
wise  polynomial  (4.53)  for  cell  c j  in  terms  of  the 
reconstruction  weights  (4.59) 

uk(x,y)  =  E  W( 

»n  +  tl<fc  I'eWcy 

or  equivalently 

uk(x}y)~  YTli  E 

Polynomials  of  degree  k  or  less  are  equivalently 
represented  in  the  transformed  coordinate  system 
using  zero  mean  polynomials 

/(*,*,)=  Y2  “<  E  "?«.■) iK^n)  (4.62) 

i€-VCj.  m4-n<fc 

Using  (4.61),  we  can  relate  weights  in  the  trans¬ 
formed  system  to  weights  in  the  original  system 

HW=  E  0*7™, „).<  (4-63) 

m+u<A: 

We  satisfy  ^-exactness  by  requiring  that  (4.62)  is 
satisfied  for  all  linear  combinations  of  (>(x,  y) 
such  that  s  +  i  <  k.  In  particular,  if  xik(x,  y )  = 
P(s,t)(x)  y)  fc>r  some  s  +  t  <  k  then 

E  WUn)>MK,t)) 

tn  +  n<k 

This  is  satisfied  if  for  all  s  H- 1,  m  -f  n  <  k 

E  -  Stnn 

<€ATC> 

Transforming  basis  polynomials  back  to  the  orig¬ 
inal  coordinate  system  we  have 

E  WU..)<  E  -  «s. 

iC-Vcj  u+u<fc 

(4.64) 

This  can  be  locally  rewritten  in  matrix  form  as 

W^A'  =  I  (4.65a) 

and  transformed  in  terms  of  the  standard  basis 
weights  via 

(4.655) 


Note  that  W'  is  a  (k  +  1  ){k  +  2)/2  by  Afj  matrix 
and  A'-  has  dimensions  A fj  by  (k  +  1  )(k  +  2)/2. 
To  solve  (4.65a)  in  the  optimum  sense  described 
above,  an  LjQj  decomposition  of  A'-  is  performed 
where  the  orthogonal  matrix  Q j  and  the  lower  tri¬ 
angular  matrix  Lj  have  been  constructed  using  a 
modified  Gram-Schmidt  algorithm  (or  a  sequence 
of  Householder  reflections).  The  weights  W'  are 
then  given  by 

w;  =  q -lj1 

Applying  (4.63)  these  weights  are  transformed  to 
the  standard  centroid  basis  and  the  preprocessing 
step  is  complete. 

We  now  show  a  few  results  presented  earlier 
in  reference  [41].  The  first  calculation  involves  the 
reconstruction  of  a  sixth  order  polynomial  with 
random  normalized  coefficients  which  has  been 
cell  averaged  onto  a  random  mesh.  Figures  4.7a- 
b  show  a  sample  mesh  and  the  absolute  L->,  error 
of  the  reconstruction  for  various  meshes  and  re¬ 
construction  degree. 


Figure  4.7a  Random  mesh. 


Figure  4.7b  L%  error  of  reconstruction. 
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The  reconstruction  algorithm  has  also  been 
tested  on  more  realistic  problems,  Figures  4.8a- 
c  show  a  mesh  and  reconstructions  (linear  and 
quadratic)  of  a  cell  averaged  density  field  corre¬ 
sponding  to  a  Ringleb  flow,  an  exact  hodograph 
solution  of  the  gasdynamic  equations,  see  [42]. 


Figure  4.8c  Piecewise  quadratic  reconstruction 
of  Ringleb  flow. 


4.6  Unwind  Advcction  Scheme  with  k  =  0 
Reconstruction 


Figure  4.8a  Randomized  mesh  for  Ringleb  flow. 


Figure  4.8b  Piecewise  linear  reconstruction  of 
Ringleb  flow. 

The  reader  should  note  that  the  use  of  piecewise 
contours  gives  a  crude  visual  critique  as  to  how 
well  the  solution  is  represented  by  the  piecewise 
polynomials.  The  improvement  from  linear  to 
quadratic  is  dramatic  in  the  case  of  Ringleb  flow. 
A  later  section  will  show  actual  numerical  solu¬ 
tions  computed  using  this  reconstruction  opera¬ 
tor. 


This  is  the  simplest  (first  order)  approxima¬ 
tion  in  which  the  polynomial  behavior  in  each  cell, 
c j,  is  a  constant  value  equal  to  the  cell  average. 

uK~°(x.y)  —  Uj  for  uk  <E  cj  (4.66) 

The  flux  formula  then  simplifies  to  the  follow¬ 
ing  form  (for  clarity  Tij  is  locally  numbered  u0 
as  shown  in  figure  4.1a) 

h(«+,«",n)j  h(uj,uo,  n,) 

+  /('«;, n,))  (467) 
-i|«(u,, n,)|(ili  -no) 
r/ 

In  this  formula,  n;  —  +  for  any  simple  path. 

t—  1/3 

Dy  summing  over  all  edges  of  the  control  volume, 
the  entire  scheme  for  Cj  is  written 

d  f  _  rf(c,)  1 

—  I  u0  </«-f  2  “<)  +  /(«<>"•)) 

-  >3  ola(fi»  ®*)l  (h.  -  «o)  =  0 
i=i  * 

(4.68) 

It  is  not  difficult  to  prove  stability  and  monotonic¬ 
ity  of  this  scheme. 
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Recall  that  the  flux  function  was  constructed 
from  a  mean  value  linearization  such  that 

/(«<,  n.)  -  /(«o,  n<)  =  «(«i,  M.)  («.'  -  «o)  (4.69) 

with  «i  =  0uo+(l-0)«i.  6  £  [C,  lj.  This  permits 
regrouping  terms  into  the  following  form: 

d  r  d(Y 

—  /  u0da  +  E  ii.»o,ni) 

dt  4  tx 

<*(*<)  j 

+  E  2  («(“<>“»■)  ~  |«(tti,n,)|)  (Ui  -  «o)  =  0 

(4.70) 

For  any  closed  control  volume,  we  have  that 

d(c,) 

E  /0*n,  Mf)  =  0 

i=l 

Combining  the  remaining  terms  yields  a  final  form 
for  analysis  (a  =  «+  4-  a-,  |«|  =  a+  —  a“): 

—  /  uorfa+r  o(uMfli)-(«,-«o)  =  0  (4.71) 

4  i=i 

To  verify  the  monotonicity  of  the  scheme  ai  steady 
sta-e,  set  the  time  term  to  zero  and  solve  for  «o. 

*  -  (4-72) 

All  weights  <v,  are  positive  and  sum  to  unity.  The 
scheme  is  monotone  since  u'o  is  a  positive  weighted 
average  of  all  neighbors.  This  implies  a  maximum 
principle  since  7/q  is  bounded  from  above  and  be¬ 
low  by  the  maximum  and  minimum  of  neighbor¬ 
ing  values  (and  itself),  umax  and 


Mm»n  5:  H0  £  ?hnni 


(4.7-3) 


For  explicit  time  stepping,  a  CFL-like  condition 
is  obtained  for  monotonicity.  For  Euler  explicit 
time  stepping,  we  have  the  time  approximation, 


0  1  ,  w 

mXJ,°d 


w'H-l  _  57 « 
“0 


which  results  in  the  following  scheme: 


«0+1  =«o  -  7"  E  «(«.-. «.-)'(«,”  -  «o") 

i=i 


d(Ci) 

-  E 


It  should  be  clear  that  coefficients  in  (4.74)  sum  to 
unity.  To  prove  monotonicity  in  time  and  space, 
it  is  sufficient  to  show  positivity  of  coefficients. 
By  inspection  we  have  that  o,  >  0  V  i  >  0.  To 
guarantee  monotonicity  requires  that  oto  >  0. 


At  i 

ft»  =  1  +  7-Ea(“*’"*r  (4-75) 

c  i=  1 

Thus,  a  CFL-like  condition  is  obtained  which  in¬ 
sures  monotonicity  and  stability. 


A  t<~ 


(4.76) 


Note  that  in  one  dimension,  this  number  corre¬ 
sponds  to  the  conventional  CFL  number.  In  mul¬ 
tiple  space  dimensions,  this  inequality  is  sufficient 
but  not  necessary  for  stability.  In  practice  some¬ 
what  larger  timestep  values  may  be  used. 

Conclusion:  The  upwind  algorithm  (4-68)  using 
piecewise  constant  data  satisfies  a  discrete  maxi¬ 
mum  principle  for  general  unstructured  meshes. 

4.7  Unwind  Advection  Schemes  with  Linear 
(k  =  1)  Reconstruction 

In  this  section,  we  consider  advection  schemes 
based  ou  linear  reconstruction.  The  process  of 
linear  reconstruction  in  one  dimension  is  depicted 
in  figure  4.9.  I 


Ill. 


(4.74) 


Figure  4.9  Linear  Reconstruction  of  cell-averaged 
data. 

One  of  the  m  <>st  important  observations  concern¬ 
ing  linear  reconstruction  is  that  we  can  dispense 
with  the  notion  of  cell  averages  as  unknowns  by 
reinterpreting  the  unknowns  as  pointwise  values 
of  the  solution  sampled  at  the  centroid  ( midpoint 
in  1-D)  of  the  control  volume.  This  well  known 
result  greatly  simplifies  schemes  based  on  linear 
reconstruction.  The  linear  reconstruction  in  each 


interval  shown  in  figure  4.9  was  obtained  by  a  sim¬ 
ple  central-difference  formula  given  point  values 
of  the  solution  at  the  midpoint  of  each  interval. 

In  section  4.3,  results  for  the  Ringlcb  flow 
with  linear  reconstruction  were  presented.  The 
reconstruction  strategy  presented  there  satisfies 
ail  the  design  requirements  of  the  reconstruction 
operator.  For  linear  reconstruction,  simpler  for¬ 
mulations  are  possible  which  exploit  the  edge  data 
structure.  Several  of  these  reconstruction  schemes 
are  given  below.  Note  that  for  steady-state  com¬ 
putations,  conservation  of  the  mean  in  the  data 
reconstruction  is  not  necessary.  The  implication 
of  violating  this  conservation  is  that  a  nondiag¬ 
onal  mass  matrix  appears  in  the  time  integral. 
Since  time  derivatives  vanish  at  steady-state,  the 
effect  of  this  mass  matrix  vanishes  at  steady-state. 
The  reconstruction  schemes  presented  below  as¬ 
sume  that  solution  variables  are  placed  at  the 
vertices  of  the  mesh,  which  may  not  be  at  the 
precise  centroid  of  the  control  volume,  thus  vio¬ 
lating  conservation  of  the  mean.  The  schemes  can 
all  be  implemented  using  an  edge  data  structure 
and  satisfy  k-exactness  for  linear  functions. 

4.7a  Green-Gauss  Reconstruction 

This  reconstruction  exploits  the  gradient  cal¬ 
culation  (4.19)  studied  earlier  in  section  4.3: 


where  no;  is  the  vector  normal  associated  with 
the  edge  e(i>o>v;)-  This  approximation  extends 
naturally  to  three  dimensions,  see  Barth  [43]. 

4.7b  Linear  Least-Squares  (L->)  Reconstruction 

To  derive  this  reconstruction  technique,  con¬ 
sider  a  vertex  Vo  and  suppose  that  the  solution 
varies  linearly  over  the  support  of  adjacent  neigh¬ 
bors  of  the  mesh.  In  this  case,  the  change  in  ver¬ 
tex  values  of  the  solution  along  an  edge  e(w;,i>o) 
can  be  calculated  by 

(Vuft)o  ■  (R,-  -  Rq)  =  iq  —  ti0  (4.78) 

This  equation  represents  the  sealed  projection  of 
the  gradient  along  the  edge  c(vj,v0).  A  similar 
equation  could  be  written  for  all  incident,  edges 
subject  to  an  arbitrary  weighting  factor.  The  re¬ 
sult  is  the  following  matrix  equation,  shown  here 
in  three  dimensions: 

ttqAxi  uqAj/j  uqAzi 
w„Ax„  wtjAyJt  wnAzn 

(4.79) 


/«A  /«>i(ui-«o)' 
V  zf  \W„(u„-Uo). 
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or  in  symbolic  form  £  Vu  =  f  where 

£=[£,  L2  L3]  (4.80) 

in  three  dimensions.  Exact  calculation  of  gradi¬ 
ents  for  linear  u  is  guaranteed  if  any  three  row 
vectors  ttfj(R,-  -  Rq)  span  all  of  3  space.  This  im¬ 
plies  linear  independence  of  L1;  L2,  and  £3.  The 
system  can  then  be  solved  via  a  Gram-Schmidt 
process,  i.e., 

'V,  .  fl  0  0‘ 

V2  [Li  L2  L3]  =  0  10  (4.81) 

V3J  [0  0  1 

The  row  vectors  V,  are  given  by  V,'  — 
where 

Ul  =  (*33*22  -  *23*23)^1  —  (*33*12  “  *23*13)h2 
(*22*13  —  *23*12)U3 

U2  =033*11  “  *13*13)^2  “  (*33*12  -  *13*23)Bi 
-(*11*23  "  /l3*l2)L;l 

U3  =(*11*22  -  *12*12)^3  -  (*22*13  -  *12*23)Li 
-(*11*23  “  *12*13)£2 

and  lij  —  (L,-  •  £_,). 

Note  that  reconstruction  of  N  independent 
variables  in  R'  lies  (dtl)  +  d  JV  inner  product 
sums.  Since  aly  d  N  of  these  sums  involves  the 
solution  variau..  s  themselves,  the  remaining  sums 
could  be  precalculated  and  stored  in  computer 
memory.  This  makes  the  present  scheme  compet¬ 
itive  with  the  Green-Gauss  reconstruction.  Using 
the  edge  data  structure,  the  calculation  of  inner 
product  sums  can  bo  calculated  for  arbitrary  com¬ 
binations  of  polyhedral  cells.  In  all  cases  linear 
functions  are  reconstructed  exactly.  We  demon¬ 
strate  this  idea  by  example: 

For  k  =  l,n(e)  /  Loop  through  edges  of  mesh 
j i  =  e-1  (fc,  1)  !  Pointer  to  edge  origin 

32  =  e~l(k,2)  !  Pointer  to  edge  destination 

dx  —  w(k)  •  (x(y2)  -  ic(  >i ))  /  Weighted  Ax 

dy  =  u>(k)  ■  (y{j2)  -  y(ji))  !  Weighted  Ay 
*n(ji)  =  hi(ji)  +  dx -dx  !  lu  orig  sum 
*  1 1 0*2 )  =  *11  (72 )  +  dx  •  dx  !  *11  dest  sum 
*12(7*1)  =  *12(71)  +  dx  dy  !  *12  orig  sum 
*12(72)  =  *12(72)  +  dx  •  dy  !  *12  dest  sum 


du  =  w(k)  •  (u(j2)  -  «(ji))  !  Weighted  Au 

*113(71 )  =  *113(71)  +  dz  •  du  !  lus  orig  sum 
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lu^(j2)  =  lu2(j2)  +  dz  •  du  !  I113  dest  sum 

Endfor 

This  formulation  provides  freedom  in  the  choice 
of  weighting  coefficients,  These  weighting 
coefficients  can  be  a  function  of  the  geometry 
and/or  solution.  Classical  approximations  in  one 
dimension  can  be  recovered  by  choosing  geomet¬ 
rical  weights  of  the  form  u>,  =  l./|Rt-  -  Ro|*  for 
values  of  t  =  0, 1, 2.  The  L2  gradient  calculation 
technique  is  optimal  in  a  weighted  least  squares 
sense  and  determines  gradient  coefficients  with 
least  sensitivity  to  Gaussian  noise.  This  is  an  im¬ 
portant  property  when  dealing  with  highly  dis¬ 
torted  (stretched)  meshes. 

4.7c  Data  Dependent  Reconstruction 

Both  the  Green-  Gauss  and  L2  gradient  calcula¬ 
tion  techniques  can  be  generalized  to  include  data 
dependent  (i.e.  solution  dependent)  weights.  In 
the  case  of  Green-Gauss  formulation,  the  sum 

53  J(«o  +i<i)n0,- 

■  €l o 


is  replaced  by 

53  Po.  ^wo+u.Ono.+p?,  ^  ((Vti)o  •  (R,  -  Ro))  no. 

(4.S?) 

If  the  p*  arc  chosen  such  that  Pq.  4-  ;>q(  =  1  then 
the  gradient  calculation  is  exact  whenever  the  so¬ 
lution  varies  linearly  over  the  support.  In  two 
space  dimensions,  equation  (4.82)  implies  the  so¬ 
lution  of  a  linear  2x2  system  of  the  form 


A0  - 
-myx 


~m*v  Y u*\  _ 

-  ™yyJW 


Po.2(tto+u')“')' 


where 


m 


XX 


—  5>o,A*,«,„ 

t'€Io 


ms.v  =  53  PoAvShi 

«€l  o 


-  5Z  PoAXinVH  mVX  =  5Z  PoAVi71*. 

*  €  io 

Core  must  be  exercised  in  the  selection  of  p±  in 
order  that  the  system  be  invertible,  This  is  sim¬ 
ilar  to  the  spanning  space  requirement  of  the  L2 
gradient  calculation  technique. 

4.7d  Monotonicitv  Enforcement 


to  prevent  oscillations  from  developing  in  the  nu¬ 
merical  solution.  One  way  to  do  this  was  pio¬ 
neered  by  van  Leer  (35]  in  the  late  1970’s.  The 
basic  idea  is  to  take  the  reconstructed  piecewise 
polynomials  and  enforce  strict  monotonicity  in 
the  reconstruction.  Mouotouicity  in  this  context 
should  be  interpreted  to  mean  that  the  value  of 
the  reconstructed  polynomial  does  not  exceed  the 
minimum  and  maximum  of  neighboring  cell  av¬ 
erages.  In  other  words,  the  final  reconstruction 
must  guarantee  that  no  new  extrema  have  been 
created.  This  will  be  referred  to  as  ‘monotonicity 
property  1.’  When  a  new  extremum  is  produced, 
the  slope  of  the  reconstruction  in  that,  interval 
is  reduced  until  monotonicity  is  restored.  This 
implies  that  at  a  local  minimum  or  maximum  in 


Figure  4.10  Linear  Reconstruction  with  mono¬ 
tone  limiting. 


Another  property  (referred  to  hereafter  as  ‘prop¬ 
erty  2’)  of  the  monotonicity  enforcement  is  mo¬ 
tivated  by  the  stability  proof  associated  wit1,  the 
higher  order  accurate  schemes  (presented  in  sec¬ 
tion  4.7e).  In  one  dimension,  property  2  can  be 
characterized  as  the  requirement  that  the  new  re¬ 
construction  not  produce  a  reconstructed  solution 
variation,  f  |rfu|,  which  is  larger  than  the  piece- 
wise  constant  value.  If  property  2  is  violated  then 
the  slopes  must  be  reduced  until  the  solution  vari¬ 
ation  is  satisfied.  This  situation  is  depicted  in 
figure  4.11.  For  arbitrary  unstructured  grids,  a 
sufficient  condition  is  that  the  differences  in  the 
extrapolated  states  at  a  cell  interface  quadrature 
point  be  of  the  same  sign  as  the  difference  in  the 
piecewise  constant  values,  i.e. 


=r — rr  >  0,  (property  2) 
tr  -  u 

when  combined  with  property  1,  the  following  in¬ 
equality  exists: 


1  > 


>  0 


When  solution  discontinuites  and  steep  gradi¬ 
ents  as  present,  additional  steps  must  be  taken 


(4.83) 
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This  inequality  is  crucial  in  the  stability  proof 
given  below, 


Figure  4.11  (a)  Reconstruction  profile  with  in¬ 
creased  variation  violating  monotouicity  property 
2.  (b)  Profile  after  modification  to  satisfy  mono¬ 
tonicity  property  2. 

In  Barth  and  Jespersen  [40],  we  gave  a  simple- 
recipe  for  invoking  property  1.  Consider  writing 
the  linearly  reconstructed  data  in  the  following 
form: 

«*(*.  y)j  =  Uj  +  v II*  ■  (R  -  Rj )  (4.84a) 

Now  consider  a  “limited”  form  of  this  piecewise 
linear  distribution. 

«*■’(*>  y)i  =  «j  H'  Vu*  ■  (R  -  R j)  (4.84fc) 

The  idea  is  to  find  the  largest  admissible  while 
invoking  a  monotonicity  principle  that  values  of 
the  linearly  reconstructed  function  must  not  ex¬ 
ceed  the  maximum  and  minimum  of  neighboring 
centroid  values  (including  the  centroid  value  in 
Cj).  To  do  this,  first  compute 

Uy  —  min(li  j,  Uneighbors) 


and 

llj  ~  max(«j,  UHCighbor*) 
then  require  that 


ufin  <  u(xty)j  <  (4.85) 

For  linear  reconstructions,  extrema  in  u(x,  y)1- 
occur  at.  the  vertices  of  the  control  volume  and 
sufficient  conditions  for  (4.85)  can  be  easily  ob¬ 
tained.  For  each  vertex  of  the  cell  compute  u;,= 
uk(xi,  j/i)j,  t  =  1  ,NCi  to  determine  the  limited 
value,  <f>i)  which  satisfies  (4.84): 


<f>i  = 


{min(l, 

min(l, 

1 


u?a*-Th 


u  i- 
um*n 


Ui- 


),  if  Ui  —  Uj  >  0 

),  if  Ui  —  Tij  <  0 

if  Ui  —  Tij  =  0 


with  <&j  =  ).  In  practice, 

the  reconstructed  polynomial  may  be  calculated 
at  the  flux  quadrat, •u-’  points  instead  of  the  ver¬ 
tices  of  the  control  volume  with  a  negligible  degra¬ 
dation  in  Monotouicity.  In  the  implementation  of 
property  2,  we  prefer  a  “symmetric”  reduction 
of  slopes.  In  other  words,  at  interfaces  violating 
property  2,  both  of  the  two  cells  sharing  that  in¬ 
terface  reduce  their  slope  until  (4.83)  is  satisfied. 

When  the  above  procedures  are  combined  with 
the  flux  function  given  earlier  (4.58), 


h(«+ ,  u  ,  a)  =  -  (f(u+ ,  n)  +  /(«  ,  n)) 

-4la(«>n)l(«+  -«") 


(4.58) 


the  resulting  scheme  has  very  good  shock  resolv¬ 
ing  characteristics.  To  demonstrate  this  fact,  we 
consider  the  scalar  nonlinear  hyperbolic  problem 
suggested  by  Struijs,  Deconinck,  et  al  [44].  The 
equation  is  a  multidimensional  form  of  Burger’s 
equation. 


ut  +  ( u2/2)*  +  uy  =  0 


Wc  solve  the  equation  in  a  square  region  [0, 1.5]  x 
[0,1.5]  with  boundary  conditions:  u(x,  0)  =  1.5- 
2x,  x  <  1,  u(x, 0)  =  —.5,  x  >  1,  u(0,  y)  =  1.5, 
and  u(1.5,y)  =  -.5.  Figures  4.12  and  4.13  show 
carpet  plots  mid  contours  of  the  solution  on  reg¬ 
ular  and  irregular  meshes. 


Figure  4.12a  Carpet  plot  of  Burger’s  equation 
solution  on  regular  mesh. 


6 -Hi 


Figure  4.12b  Solution  Contours  on  regular  mesh. 


Note  that  the  carpet  plots  indicate  that  the  nu¬ 
merical  solution  on  both  meshes  is  monotone.  Even 
so,  most  people  would  prefer  the  solution  on  the 
regular  mesh.  This  is  an  unavoidable  consequence 
of  irregular  meshes.  The  only  remedy  appears  to 
be  mesh  adaptation.  Simf’ar  results  for  the  Euler 
equations  will  be  shown  on  irregular  meshes  in  a 
future  section. 

4.7e  Stability  Analysis  via  Energy  Methods 

Consider  once  again  the  local  mesh  shown  in 
figure  4.1a  with  local  index  about  a  vertex  vq.  In 
the  analysis  performed  below,  we  consider  energy 
stability  of  schemes  of  the  following  form 

d 

— ti0>io  =  -  h(u+,«“,n)0,-  (4.86) 

i=l 

using  linear  reconstruction  with  limiting.  Note 
that  in  this  analysis  all  boundary  effects  will  be 
ignored.  In  section  4.5,  stability  of  the  first  or¬ 
der  upwind  scheme  was  proven  using  monotonic¬ 
ity  analysis.  Before  considering  the  higher  order 
schemes,  we  briefly  digress  to  show  stability  of  the 
first  order  upwind  scheme  using  energy  methods. 
Using  the  same  techniques,  energy  stability  of  the 
high  order  schemes  with  reconstruction  and  lim¬ 
iting  will  be  shown. 

Energy  Analysis  for  the  k  =  0  scheme 

In  this  case,  the  flux  takes  the  simple  form  and 
the  scheme  for  a  single  vertex  Vg  can  be  written 
as  indicated  below 


Figure  4.13a  Carpet  plot  of  Burger’s  equation 
solution  on  irregular  mesh. 


■l(cj) 

-^2  “«')l  -  «o)  =  o 


(4.87) 


or  in  symbolic  operator  form,  where  u  denotes 
the  solution  vector,  i.e.  u  =  - 

In  this  symbolic  form,  the  scheme  is  written  as 


(Du)i  +  Ca u  -  Cd u  =  0  (4.88) 

where  D  is  a  positive  diagonal  matrix  cont  ining 
the  area  of  each  control  volume.  Ca  and  C,  repre¬ 
sent  the  advective  and  diffusive  operators  in  this 
linear  scheme.  The  energy  of  the  system  ( 1.88)  is 
given  by  the  following  equation: 

(u7  D0u)t+uT(£„  +  C]')  u-u T(Cj  +  Tj)  u  =  0 

(4.89) 


It  is  a  straightforward  exercise  to  show  that  in 
tlie  linear  case,  C„  and  Ct[  are  skew-symmetric 
(isoenergetic)  operators,  hence 

«'  (A, +<!')  u  =  0. 

The  diffusive  operator  C,i  is  symmetric  which  re¬ 
duces  the  energy  equation  to  the  following  form: 

~  (u7'Z>0u),  -  n'  Caii  =  0  (4.90) 

FVom  symmetry  and  application  of  the  eigenvalue 
circle  theorem,  it  is  easily  shown  that  C,t  is  a 
symmetric,  negative  semi-definite  matrix  opera¬ 
tor  which  implies  that 

u rCdu  <  0 

for  all  u.  This  establishes  that  the  scheme  is  en¬ 
ergy  stable  since 

(u7  D0u)l  <  0 


From  symmetry  and  the  eigenvalue  circle  theorem 
we  have  that 

u 7  <  0  (4.92) 

It  remains  to  be  shown  that  the  advcction  oper¬ 
ator  C„  is  cither  isoenergetic  (in  the  linear  case) 
or  decays  energy  in  the  system.  Not  all  extrap¬ 
olation  formulas  guarantee  that  this  is  true.  A 
full  discussion  of  this  topic  is  beyond  the  scope  of 
these  notes  and  is  a  subject  of  current  research. 
Note  that  in  reference  [45],  we  indicated  a  pref¬ 
erence  for  a  standard  Galerkin  discretization  of 
j C„.  Since  this  operator  is  isoenergetic,  when  com¬ 
bined  with  the  diffusion  operator  described  above, 
the  entire  scheme  is  provably  stable  in  an  energy 
norm. 

4.8  Maximum  Principles  and  the 
Delannav  Triangulation 


Energy  Analysis  for  the  k  —  1  scheme. 

We  now  consider  the  advection  scheme  with  lin¬ 
ear  reconstruction.  The  interface  states  for  the 
edge  of  the  control  volume  separating  cells  Co  and 
c,  are  denoted  by  Uq  and  u~ ,  respectively.  The 
scheme  is  written  in  the  familiar  form: 

(Ao«o),-F  Y  2  M“o  ’  +  /(«r .«>)) 


£«»< 

«',)  j 

-  Y  2  !"(«**  »*)!  («r  -  «o )  =  o 


(4.91) 

Consider  rewriting  equation  (4.91)  using  the  iden¬ 
tity 


»"  ~  uo  =  ( ~= — A—)  («•  -  mo)  =  -  i<„) 

\  «t  -  «0  / 


which  tacitly  assumes  that  the  ratio  exists.  Mono- 
tonicity  properties  1  and  2  guarantee  that  \}>  € 
[0,1].  Thus,  equation  (4.91)  is  rewritten  in  the 
nonlinear  form: 


n.) +  /(«,■ ,«,)) 


j 

-  Y  “<)!(«••  ~  «o)  =  0 


The  edge  formulas  presented  earlier  not  only 
provide  an  efficient  procedure  for  calculating  quan¬ 
tities  such  as  the  gradient  and  divergence,  but 
also  provide  certain  theoretical  results  which  are 
difficult  to  ascertain  otherwise.  For  example,  Cia- 
rlct  and  Raviart  [46]  consider  Galerkin  schemes 
for  solving  elliptic  equations  using  linear  finite- 
elements.  They  derive  sufficient  conditions  for 
the  existence  of  a  discrete  maximum  principle  for 
Laplace’s  equation  if  all  angles  in  the  triungula- 
tion  are  less  than  n/2  -  e  for  some  positive  c. 
Using  the  edge  formulas  derived  in  section  4.3, 
sufficient  and  necessary  conditions  can  be  derived 
for  a  discrete  maximum  principle  which  are  quite 
different,  from  the  Ciarlct  result.  A  brief  outline 
of  the  proof  is  given  below. 


Example:  Derive  conditions  for  a  discrete  max¬ 
imum  principle  using  a  Galerkin  approximation 
with  linear  elements. 

Using  a  reduced  form  of  (4.43),  the  canonical 
edge  formula  for  the  discrete  Laplacian  operator 
is  given  by 


f  <f>Auhda  =  C(uh)vt  = 

Jflo 

j 

2r[cotan(a£j)  +  cotan(a/tj )]  (m,  -  u0) 


i€l o 


(4.93) 

where  the  angles  a£>  and  a:e  depicted  below. 
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The  key  elements  of  the  proof  are  given  below: 
Rearrangement  of  the  weights  appearing  in  (4.93) 
yields 


Figure  4.14  Circumcircic  test  for  adjacent  tri¬ 
angles. 

It  is  well  known  that-  a  discrete  maximum  prin¬ 
ciple  exists  for  arbitrary  point  distributions  and 
boundary  data  if  and  only  if  the  discrete  operator 
is  a  nonnegative  operator,  i.e.,  if 

=  5Z  «V«''  (4.94) 

ier<» 


Wi  =  ^  [<:otan(«/^ )  +  cotan(orftj)] 

_1  [cos  (a  Lj)  cos  (0,01 
2  [sin («/,,)  sin (««,)] 

=  I  [  s\n(alji  +aui)  1 
“2  [sin(oti)sin(«ni)J 


(4.98) 


Since  o/,.  <  tt,  cm-  <  it,  the  denominator  is 
always  positive  and  nonnegativity  requires  that 
or /,.  -f  <  7r.  Some  trigonometry  reveals  that 

for  the  configuration  of  figure  4.14  with  circumcir- 
cle  passing  through  the  sum  an,  + 

cni,.  depends  on  the  location  of  u;_t  with  respect 
to  the  circumcircle  in  the  following  way: 


o Hi  4-  o/,i  <  ic,  t»,_i  exterior 

o/fi  +  o/ii  >  7r,  t?i_j  interior  (4.99) 

Oh,  -f  oi,  =  rr,  «,•„!  cocircular 


and 

w0  <  0,  u>i  >  0,  j  >  0,  w0  +  w'  ~  0  (4.95) 

ia  o 

for  any  interior  vertex  no-  For  schemes  of  the  form 

£(«*)».  =  £  ^,(«*  "  «o)  (4  %) 

* €  Xo 


nonnegativity  requires  that  IF,  >  0  for  all  i  6  In- 
This  guarantees  a  maximum  principle.  Equating 
equation  (4.96)  to  zero,  we  obtain 


„ .  e,6tc  'M 
°"  IF, 


(4.97) 


and  therefore 


min («j ,«2, «J}#)  <  «o  <  max  (u£,u5,. ..,«£,) 

A  natural  question  to  be  addressed  concerns  the 
existence  and  uniqueness  of  triangulations  of  an 
arbitrary  point  set  such  that  (4.93)  guarantees  a 
discrete  maximum  principle.  In  two  dimensions 
a  unique  triangulation  always  exists.  The  main 
result  is  summarized  in  the  following  theorem: 

The  discrete  Laplacian  operator  (4-93)  exhibits 
a  discrete  maximum  principle  for  arbitrary  point 
sets  in  two  space  dimensions  iff  the  triangulation 
of  these  points  is  a  Delaunay  triangulation. 


Also  note  that  we  could  have  considered  the  cir- 
cumcirclo passing  through  {v0)  t>,-,  w;-i}  with  sim¬ 
ilar  results  for  »,•+(.  The  condition  of  nonnegativ¬ 
ity  implies  a  circumcircle  condition  for  all  pairs  of 
adjacent  triangles  whereby  the  circumcircle  pass¬ 
ing  through  either  triangle  cannot  contain  the 
fourth  point.  This  is  precisely  the  unique  char¬ 
acterization  of  the  Delaunay  triangula  .jn  which 
would  complete  the  proof. 

Keep  in  mind  that  from  equation  (4.98)  we 
have  that  cotan(a)  >  0  if  a  <  7t/2.  Therefore 
a  sufficient  but  not  necessary  condition  for  non¬ 
negativity  (and  a  Delaunay  triangulation)  is  that 
all  angles  of  the  mesh  be  less  than  or  equal  to 
7i/2.  This  is  a  standard  result  in  finite  element 
theory  and  applies  in  two  or  more  space  dimen¬ 
sions.  The  construction  of  nonnegative  operators 
lias  important  implications  with  respect  to  the  di¬ 
agonal  dominance  of  implicit  schemes,  the  eigen¬ 
value  spectrum  of  the  discrete  operator,  stability 
of  relaxation  schemes,  etc. 

We  can  ask  if  the  result  concerning  Delaunay 
triangulation  and  the  maximum  principle  extends 
to  three  space  dimensions.  As  we  will  show,  the 
answer  is  no.  Section  4.3d  gives  the  correspond¬ 
ing  edge  formulas  for  Hessian  and  Laplacian  dis¬ 
cretizations  in  3-D.  The  resulting  formula  for  the 
three  dimensional  Laplacian  is 

/  4>h  VV  dv~Y]  W<(u,  -  ti„)  (4.100) 

Jv''o 


fi-IV 


IF.  ~  -  Y]  |ARfc+l/2|  cotaii(«M.|/2). 

0  *=i 

(4.101) 

In  t  his  formula  VV0  is  the  volume  formed  by  the 
union  of  ill  tetrahedra  that  share  vertex  r0.  Jo  is 
the  sot  of  indices  of  all  adjacent  neighbors  of  va 
couuectcd  by  incident-  edges,  k  a  local  cyclic  in¬ 
dex  describing  the  associated  vertices  which  form 
a  polygon  of  degree  d(r©,  t>j)  surrounding  the  edge 
e(t>o ,t\),  cxi t+j/2  is  the  face  angle  between  the 
two  faces  associated  with  Sr+1/2  and  s'k+  which 
share  the  edge  c(ojt,  vjt-n)  and  | 1  /2I  *s  the 
nuvgnitude  of  the  edge  (see  figure  below). 


Figure  4.15  Sot-  of  tetrahedra  sharing  int  erior 
edge  c(<;0,  v,j  with  local  cyclic  index  k. 

A  maximum  principle  is  guaranteed  if  all  IV,  >  0, 
We  now  will  procedc  to  describe  a  valid  Delau¬ 
nay  triangulation  with  one  or  more  Wj  <  0.  It 
will  suffice  to  consider  the  Delaunay  triangulation 
of  N  points  in  which  a  single  point  oa  lies  inte¬ 
rior  to  the  triangulation  and  the  remaining  N  -  1 
points  describe  vertices  of  boundary  faces  which 
completely  cover  the  convex  hull  of  the  point  set. 
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Consider  a  subset  of  the  N  vertices,  in  particular 
consider  an  interior  edge  incident-  to  t>o  connect¬ 
ing  to  v,  as  shown  in  figure  4.16  by  the  dashed 
line  segment  and  ali  neighbors  adjacent  to  t);  on 
the  hull  of  the  point-  set.  In  this  experiment  we 
consider  the  height  of  the  interior  edge,  z,  as  a 
free  parameter.  Although  it  will  not  be  proven 
here,  the  remaining  N  —  8  points  can  be  placed 
without  conflicting  with  any  of  the  conclusions 
obtained  for  looking  at-  the  subset- 

ft  is  known  that  a  necessary  and  sufficient-  con¬ 
dition  for  the  3-D  Delaunay  triangulation  is  that- 
the  circumsphcrc  passing  through  the  vertices  cf 
any  tetrahedron  must,  be  point  free  (21);  that  is  to 
say  that,  no  other  point  of  the  triangulation  can 
lie  interior  to  this  sphere.  Furthermore  a  property 
of  locality  exists  [21]  so  that  we  need  only  inspect 
adjacent,  tetrahedra  for  the  satisfaction  of  the  cir* 
cumsphere  test-  For  the  configuration  of  points 
shown  in  figure  4.16,  convexity  of  the  point  cloud 
constrains  z  >  1  and  the  satisfaction  of  the  cir- 
emnsphere  test  requires  that.  2  <  2. 

1  <  z  <  2  (Delaunay  Triangulation) 

From  (2.13)  wo  find  that  Wi  >  0  if  and  only  if 
2  <  7/4. 

1  <  z  <  -j,  (Nonnegativity) 

This  indicates  that  for  7/4  <  ?.  <  2  we  have  a 
valid  Delaunay  triangulation  which  does  not  sat¬ 
isfy  a  discrete  maximum  principle,  In  fact-,  the 
Delaunay  triangulation  of  400  points  randomly 
distributed  in  the  unit  cube  revealed  that  approx¬ 
imately  25%  of  the  interior  edge  weights  were  of 
the  wrong  sign  (negative). 

Keep  in  mind  that  from  (4.101)  wo  can  obtain  a 
sufficient  but  not  necessary  condition  for  nonneg¬ 
ativity  that  all  face  angles  be  less  than  or  equal 
to  7r/2. 

The  formulas  for  the  prototypical  viscous  term 
V  •  /«Vtt  are  only  slightly  more  complicated  than 
the  Laplacian  formulas.  In  2-D  we  have  the  fol¬ 
lowing  weights 

Vri  =  \  [7*/,.<  °bm(o,.,)  -I-  r*H,cotan(aHi)] 

(4,102) 

or  in  3-D 


wi  =  £  53  frt-+i/2|AR*+)/2|  eotan(tt*+,/2) 

'  k=  1 

(4.103) 


Figure  4.16  Subset  of  3-D  Delaunay  TViangula- 
t-ion  which  does  not  maintain  nonnegativity. 
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where  ft  is  the  average  value  of  p  in  the  specified 
simplex.  Since  p  and  jl  are  always  assumed  posi¬ 
tive  quantities,  we  have  the  following  theorem: 

A  discrete  maximum  principle  associated  with 
the  discretization  of  V  •  /iVn  with  weights  given 
by  (4.102)  and  (4. 108)  is  guaranteed  iff  Wi  >  0 
for  all  interior  edges  of  the  mesh.  A  sufficient  but. 
not  necessary  condition  is  that  all  angles  (2-D)  or 
faces  angles  (3-D)  be  less  than  or  equal  to  tt/2. 

The  proof  follows  immediately  from  nonnegativ¬ 
ity  of  (4.102)  and  (4.103).  The  sufficient  but  not 
necessary  condition  is  a  minor  extension  of  the 
result  by  Ciarlct  [40). 
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u  do  4*  /  F(u)  *  11  dl  =  0 


(5.5) 


with 


(  i'\  (  tfV.n)  \ 

u  —  I  f>V  ,  F(*i)  ■  11  -  pV(V  11)  +  pn 

\E  )  \(E  +  p)(V.n)  J 


Iti  the  next-  section,  we  show  the  natural  ex¬ 
tension  of  the  scalar  advection  scheme  to  include 
(5.5). 


5.0  Finite- Volume  Solvers  for  the  Euler 
Equations 

In  this  section,  we  consider  the  extension  of 
upwind  scalar  advection  schemes  to  the  Euler  equa¬ 
tions  of  gasdynamics.  As  we  will  see.  the  changes 
are  relatively  minor  since  most  of  the  difficult, 
work  has  already  been  done  in  designing  the  scalar 
scheme. 

5.1  Euler  Equations  in  Integral  Form 

The  physical  laws  concerning  the  conserva¬ 
tion  of  mass,  momentum,  and  energy  for  an  ar¬ 
bitrary  region  ft  can  be  written  in  the  following 
integral  form: 

Conservation  of  Mass 

ir  f  fida  if  p(V  -n)  dl  —  0  (5.1) 

01  Jn  Jon 

Conservation  of  Momentum 

f  pV  da+  /  />V(V-n )dl+  f  pudl  —  Q 
0t  Jq  Jo n  Jon 

(5.2) 

Conservation  of  Energy 

~  f  E  da  +  /  (E  +  p)(V  •  n)  dl  =  0  (5.3) 
vt  Jn  Jon 

In  these  equations  p,  V,  j>,  and  E  are  the  density, 
velocity,  pressure,  and  total  energy  of  the  fluid. 
The  system  is  closed  by  introducing  a  thermody¬ 
namical  equation  of  state  for  a  perfect  gas: 

p=(T-l)(£?-^(V.V))  (5.4) 

These  equations  can  be  written  in  a  more  com¬ 
pact  vector  equation: 


5.2  Extension  of  Scalar  Advection  Schemes  to 
Systems  of  Equations 

The  extension  of  the  scalar  advection  schemes 
to  the  Euler  equations  requires  two  rather  minor 
modifications: 

(1)  Vector  Flux  Function.  The  scalar  flux  func¬ 
tion  is  replaced  by  a  vector  flux  function.  In  the 
present  work,  the  mean  value  linearization  due  to 
Roe  [47]  is  used.  The  form  of  this  vector  flux  func¬ 
tion  is  identical  to  the  scalar  flux  function  (4.53), 
i.o. 

h(u+,u~,n)  (f(u+,n)  +  f(u~,n)) 

1  (5-6) 

~oM(»)l(u+  -  U-) 

£ 

where  f(u,n)  ~  F(u)  n,  and  A  —  df/dn  is  the 
fhix  Jacobian. 

(2)  Componentwise,  limiting.  The  solution  vari¬ 
ables  are  reconstructed  componentwise.  In  prin¬ 
ciple,  any  set  of  variables  can  be  used  in  the  recon¬ 
struction  (primitive  variables,  entropy  variables, 
etc.).  Note  that  conservation  of  the  mean  can 
make  certain  valuable  combinations  more  difficult 
to  implement  than  others  because  of  the  nonlin- 
earities  that  may  be  introduced.  The  simplest 
choice  is  obviously  the  conserved  variables  them¬ 
selves.  When  conservation  of  the  mean  is  not  im¬ 
portant  (steady-state  calculations),  we  prefer  the 
use  of  primitive  variables  in  the  reconstruction 
step. 

The  resulting  scheme  for  the  Euler  equations 
has  the  same  shock  resolving  characteristics  as 
the  scalar  scheme.  Figures  5.1a-b  show  a  simple 
Steiner  triangulatiou  and  the  resulting  solution 
obtained  with  a  linear  reconstruction  scheme  for 
transonic  Euler  flow  ( Moo  —  80,  o:  =  1.25°)  over 
a  NACA  0012  airfoil  section. 
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Figure  5.1a  Initial  triangulation  of  airfoil,  3155 
vertices. 

Even  though  the  grid  is  very  coarse  with  only 
3155  vertices,  the  upper  surface  shock  is  captured 
cleanly  with  a  profile  that  extends  over  two  cells 
of  the  mesh. 


Figure  5.1b  Mach  number  contours  on  initial 
triangulation,  Moo  —  80,  <*=  1.25®. 

Clearly,  the  power  of  the  unstructured  grid  method 
is  the  ability  to  locally  adapt  the  mesh  to  resolve 
llow  features.  Figures  5.2a-b  show  an  adaptively 
refined  mesh  and  solution  for  the  same  llow.  1  he 
mesh  has  been  locally  refined  based  on  a  poste¬ 
riori  error  estimates.  These  estimates  were  ob¬ 
tained  by  performing  fc-exact  reconstruction  in 
each  control  volume  using  linear  and  quadratic 
functions.  A  complete  discussion  of  error  esti¬ 


mation  and  solution  adaption  will  be  given  by 
Professor  Johnson  in  these  notes.  The  paper  by 
Warren  et  al  [29]  also  provides  some  interesting 
insights  into  the  area  of  mesh  adaptation  for  Hows 
containing  discontinuities. 


Figure  5.2a  Solution  adaptive  triangulation  of 
airfoil,  6917  vertices. 

The  How  features  in  figure  5.2b  are  clearly  defined 
with  a  weak  lower  surface  shock  now  visible.  Fig¬ 
ure  5.3  shows  the  surface  pressure  coefficient  dis¬ 
tribution  on  the  airfoil.  The  discontinuities  are 
crisply  captured  by  the  scheme. 


Figure  5.2b  Mach  number  contours  on  adapted 
airfoil. 

The  other  major  advantage  of  unstructured  grids 
is  the  ability  to  automatically  mesh  complex  ge¬ 
ometries.  The  next  example  shown  in  figure  5.4a- 


I)  is  a  St oiuor  trinngulatiou  and  solution  about,  a 
multi-component  airfoil. 

-1.5  |  - , - 1 
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Figaro  5.3  Comparison  of  C,,  distributions  on 
initial  and  adapted  meshes. 

Using  t  lu*  incremental  Steiner  algorit  hm  discussed 
previously,  the  grid  ran  constructed  from  curve 
tint  a  in  about  ten  minutes  time  on  a  standard  en¬ 
gineering  workstation  using  less  than  a  minute  of 
actual  CPU  time. 


Figaro  5.4a  Steiner  Grid  about  multi-component 
airfoil. 

The  flow  calculation  shown  in  figure  5.1b  was  per¬ 
formed  on  a  CRAY  supercomputer  taking  just  a 
few  minutes  of  CPU  time  using  a  linear  recon¬ 
struction  scheme  with  implicit  time  advancement. 
Details  of  the  implicit  scheme  are  given  in  the 
next,  section. 


Figure  5.4b  Mach  number  contours  about  multi¬ 
component  airfoil,  Mr, c  =  .2, n  =  0°. 

We  previously  mentioned  the  importance  of 
using  aeeurate  flux  quadrature  formulas.  In  fact, 
for  A’-exact  reconstruction,  we  suggest.  A7  point 
Gauss  quadratures  with  N  >  {k  -1-  l)/2.  In  Figs, 
5.5a-b  this  importance  is  il’ustratcd  by  plotting 
density  contours  for  a  numerical  calculation  of  the 
Ringleb  flow  (previously  described)  using  quadratic 
reconstruction  k  =  2.  Our  formula  suggests  that 
t  wo  point  quadratures  should  be  used  in  this  ease. 


Figure  5.5a  Ringleb  flow  density  contours  us¬ 
ing  quadratic  reconstruction  and  one-point  Gauss 
quadrature  (k  =  2,  N  —  1). 

Figure  5.5a  shows  contours  for  a  calculation  us¬ 
ing  one-point.  Gauss  quadrature  and  Fig.  5.5b 
shows  contours  for  a  calculation  using  two-point 
quadratures.  The  improvement  in  Fig.  5.5b  is 


dramatic.  Increasing  the  number  of  quadrature 
points  to  throe  leaves  the  solution  unchanged. 


Figure  5.5b  Ringleb  How  density  contours  us¬ 
ing  quadratic  reconstruction  and  two-point  Gauss 
quadrature  (A-  =  '2,  ;V  as  2). 

The  algorithms  outlined  in  section  4  have 
l>ee)>  extruded  to  include  the  Eider  equations  in 
three  dimensions.  In  reference  (43],  we  showed  the 
natural  extension  of  the  edge  data  structure  in  the 
development  of  an  Fader  equation  solver  on  tetra¬ 
hedral  meshes.  One  of  the  calculations  presented 
in  this  paper  simulated  Euler  How  about  the  ON- 
ERA  MO  wing.  The  tetrahedral  mesh  used  for 
the  calculations  was  a  subdivided  151x17x33  hex- 
slit  dial  C-type  mesh  with  spherical  wing  (ip cap. 
The  resulting  tetrahedral  mesh  contained  4%, 350 
Ictraliedrn,  105,177  vertices,  1 1.690  boundary  ver- 
t  ices,  and  23,376  boundary  faces,  Figure  5.6  shows 
a  closeup  of  the  surface  mesh  near  the  outboard 
lip. 


Figure  5.6  Closeup  of  MG  Wing  Surface  Mesh 
Neav  Tip. 

TVansonic  calculations,  A/of,  =  .84, a  3.06°, 
were  performed  on  the  CRAY  YMP  computer 
using  the  upwind  code  with  both  the  Green-Gauss 


and  L-t  gradient  iveonstnu  lion.  Figure  5.7  shows 
surface  pressure  contours  on  the  wing  surface  atal 
Cp  profiles  at  several  spaa  stations. 


Figure  5.7  M6  Wing  Surface  Pressure  Contours 
and  Span  wise  Cv  Profiles  (A/oo=  .84,  a  =  3.06°). 

Pressure  contours  clearly  show  the  lambda 
typo  shock  pattern  on  the  wing  surface.  Figures 
5.8a-c  compare  pressure  coefficient  distributions 
at  three  span  stations  on  the  wing  measured  iu 
the  experiment,  y/h=,44,.65.,.95. 


Figure  5.8a  M6  Wing  Spamvise  Pressure  Distri¬ 
bution,  xj/b  —  .44. 

Each  graph  compares  the  upwind  code  with  Green- 
Gauss  and  L<i  gradient  calculation  with  the  CFL3D 
results  appearing  in  [48]  ami  the  experimental 
data  [49j.  Numerical  results  on  the  tetrahedral 
mesh  compare  very  favorably  with  the  CFL3D 
structured  mesh  code.  The  results  for  the  out¬ 
board  station  appear  better  for  the  present  code 
than  the  CFL3I)  results.  This  is  largely  due  to 
the  difference  in  grid  topology  and  subsequent,  im¬ 
proved  resolution  in  that  area. 


(.-54 


Figure  5.8b  M6  Wing  Spanwise  Pressure  Distri¬ 
bution  y/b  =  .65. 


Figure  5.8c  M6  Wing  Spanwise  Pressure  Distri¬ 
bution  y/b  =  .95. 

5.3  Implicit  Linearizations 

In  this  section,  we  consider  the  task  of  lin¬ 
earizing  the  discrete  spatial  operator  for  purposes 
of  backward  Euler  time  integration.  Defining  the 
solution  vector  u  =  [ui,«2,U3,  ...,«jv]r,  the  basic 
scheme  r  written  as 

Dut  =  R(u)  (5.7) 

where  D  is  a  positive  diagonal  matrix.  Perform¬ 
ing  a  backward  Euler  time  integration,  equation 
(5.7)  is  rewritten  as 

Z?(uH+1  -  u")  =  At  R(un+1).  (5.8) 

where  n  denotes  the  iteration  (time  step)  level. 
Linearizing  the  right-hand-side  (RHS)  of  (5.8)  in 
time  produces  the  following  form: 

Z5(un+1—  u")  =  At  (R(un)  +  ^~(u,l+l  -  u’*)^ 


By  rearranging  terms,  we  can  arrive  at  the  so 
called  “delta”  form  of  the  backward  Euler  scheme 

D  -  At  — -1  (u’,+1  -  u")  =  +  At  R(u”) 

1  * U  '  (5-9) 

Note  that  for  large  time  steps,  the  scheme  be¬ 
comes  equivalent  to  Newton’s  method  for  finding 
roots  of  a  nonlinear  system  of  equations.  New¬ 
ton’s  method  is  known  to  be  quadratically  con¬ 
vergent  for  isolated  roots.  Each  iteration  of  the 
scheme  requires  the  solution  of  an  algebraic  sys¬ 
tem  of  linear  equation.  In  practice,  we  use  ei¬ 
ther  sparse  direct  methods  as  discussed  in  ref. 
[45]  or  preconditioned  minimum  residual  meth¬ 
ods.  Both  of  these  topics  will  be  addressed  by 
Professor  Hughes  and  other  lecturers.  The  suc¬ 
cess  or  failure  of  these  methods  hinges  heavily  on 
the  accuracy  of  the  time  linearization.  For  the 
schemes  discussed  in  sections  4  and  5,  the  most 
difficult  task  is  the  linearization  of  the  numerical 
flux  vector  with  respect  to  the  two  solution  states. 
For  example,  given  the  flux  vector 

h(u4,u",n)  (f(u+, n)  +  f(u~, n)) 

2  (5.10) 

-i|A(u)|(u+-u-) 

we  require  the  Jacobian  terms  j—f-  and  ~r.  In 
reference  [50],  we  derived  the  exact  Jacobian  lin¬ 
earization  of  Roe’s  flux  function.  In  this  same  pa¬ 
per,  approximate  linearizations  of  (5.10)  were  in¬ 
vestigated.  The  linearization  of  (5.10)  is  straight¬ 
forward,  except  for  terms  which  arise  from  differ¬ 
entiation  of  |A(u)|.  A  simple  approximation  is 
to  neglect  these  terms  in  the  linearization  pro¬ 
cess,  This  produces  the  following  approximate 
linearizations: 

=  ^(«+)  -  |i4(u)|)  (Approx  1) 

+  M(u)|)  (Approx  1) 

It  is  not  difficult  that  to  prove  that  the  error  asso¬ 
ciated  with  this  approximation  is  0(||u+  —  u~|[) 
which  makes  the  linearization  attractive  for  the 
implicit  calculation  of  smooth  flows.  Near  dis¬ 
continuities,  this  term  becomes  0(1)  which  can 
slow  the  convergence  of  the  scheme  considerably. 
One  important  attribute  of  this  approximation  is 
that  it  retains  time  conservation  of  the  scheme. 
This  amounts  to  a  telescoping  property  of  fluxes 
in  time.  For  time  accurate  problems  involving 
moving  discontinuities,  this  property  is  essential 
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to  obtain  correct  shock  speeds.  Another  approx* 
imate  form  considered  in  [50]  uses  the  following 
simple  approximation 

=  A(u)“  (Approx  2) 

^  A( u)4  (Approx  2) 

This  linearization  also  differs  from  the  exact  lin¬ 
earization  by  terms  O(|]o+  -  u_||).  One  impor¬ 
tant  feature  of  this  linearization  is  that  it  pro¬ 
duces  a  LHS  operator  for  the  first  order  upwind 
scheme  which  is  (block)  diagonally  dominant.  For 
those  solution  methods  or  preconditioning  meth¬ 
ods  based  on  classical  relaxation  schemes,  this 
property  establishes  convergence  of  the  relaxation 
method.  Scalar  equation  analysis  also  indicates 
that  when  this  linearization  is  used  with  back¬ 
ward  Euler  time  integration  and  first  order  up¬ 
wind  space  discretization,  the  resulting  scheme  is 
monotone  for  all  time  step  size.  Unfortunately, 
this  linearization  violates  time  conservation  and 
should  not  be  used  for  time  accurate  calculations. 


Figure  5.9  Convergence  histories  for  exact  and 
approximate  linearizations.  Solid  lines  show  con¬ 
vergence  histories  for  calculations  carried  out  us¬ 
ing  first  order  upwind  RHS  in(5.9).  Dashed  line 
depicts  scheme  run  with  exact  linearization  of 
first  order  scheme  on  the  LHS  and  second  order 
RHS  discretization. 

Using  the  edge  data  structure,  the  assembly  of 
the  LHS  matrix  in  (5.9)  for  the  first  order  scheme 
is  very  straightforward.  The  flux  associated  with 
each  edge  e(vi,Vj)  of  the  mesh  is  linearized  with 
respect  its  two  arguments,  u,  and  Uj.  This  means 
that  the  linearization  contributes  to  the  forma¬ 
tion  of  the  block  matrix  elements  in  the  i-th  row 
j-th  column,  i-th  row  i-th  column,  j-th  row  i- 
th  column,  and  j-th  row  j-th  column  positions. 


This  leads  to  a  highly  vectorizable  (using  gather- 
scatter  hardware)  algorithm  for  matrix  assembly 
(and  matrix  multiplies).  For  the  higher  order 
reconstruction  schemes,  the  usual  strategy  is  to 
only  construct  LHS  matrix  terms  associat'd  with 
the  first  order  upwind  scheme  while  using  a  higher 
order  RHS  operator.  The  mismatch  of  operators 
destroys  any  hope  of  quadratic  convergence  for 
large  time  steps.  Figure  5.9  graphs  the  conver¬ 
gence  history  for  a  typical  calculation  using  the 
linearizations  discussed  above.  The  flow  prob¬ 
lem  being  solved  is  subsonic  flow  over  a  single 
airfoil.  In  this  case,  the  flow  is  smooth  and  all 
linearizations  should  be  applicable.  In  this  fig¬ 
ure,  wc  see  that  when  the  RHS  and  LHS  op¬ 
erators  both  correspond  to  the  first  order  up¬ 
wind  scheme  and  the  exact  Jacobian  linearization 
is  used,  quadratic  convergence  is  achieved.  The 
schemes  using  approximate  linearizations  do  not 
approach  quadratic  convergence  but.  are  very  ef¬ 
fective  in  reducing  the  initial  residual.  In  reality, 
most  computations  are  terminated  after  reduc¬ 
ing  the  residual  about  four  orders  of  magnitude. 
For  the  present  example,  this  would  amount  to  7 
steps  using  the  exact  linearization  or  8-9  steps  us¬ 
ing  the  approximate  forms.  Using  a  higher  order 
accurate  RHS  slows  the  convergence  even  further. 
Nevertheless,  a  four  order  reduction  in  residual  is 
achieved  after  30-40  steps. 

6.0  Numerical  Solution  of  the  Navier-Stokes 
Equations  with  Turbulence 

61  Turbulence  Modeling  for  Unstructured  Grids 

Simulating  the  effects  of  turbulence  on  un¬ 
structured  meshes  via  the  compressible  Reynolds- 
averaged  Navier-Stokes  equations  and  turbulence 
modeling  is  a  relatively  unexplored  topic.  In  early 
work  by  Rostand  [51],  an  algebraic  turbulence 
model  was  incorporated  into  an  unstructured  mesh 
flow  solver.  This  basic  methodology  was  later  re¬ 
fined  by  Mavriplis  [52]  for  the  flow  about  multi¬ 
element  airfoil  configurations.  Both  of  these  im¬ 
plementations  utilize  locally  structured  meshes 
to  produce  one-dimensional-like  boundary-layer 
profiles  from  which  algebraic  models  can  deter¬ 
mine  an  eddy  viscosity  coefficient  for  use  in  the 
Reynolds-averaged  Navier-Stokes  equations.  The 
use  of  local  structured  meshes  limits  the  general 
applicability  of  the  method. 

The  next  level  of  turbulence  modeling  in¬ 
volves  the  solution  of  one  or  more  auxiliary  differ¬ 
ential  equations.  Ideally  these  differential  equa¬ 
tions  would  only  require  initial  data  and  bound¬ 
ary  conditions  in  the  same  fashion  as  the  Reynolds- 
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averaged  mean  equations.  The  use  of  turbulence 
models  based  on  differential  equations  greatly  in¬ 
creases  the  class  of  geometries  that  can  be  treated 
“automatically.”  Unfortunately  this  does  not  make 
the  issue  of  turbulence  modeling  a  “solved”  prob¬ 
lem  since  most  turbulence  models  do  not  perform 
well  across  the  broad  range  of  flow  regimes  usu¬ 
ally  generated  by  complex  geometries.  Also  keep 
in  mind  that  most  turbulence  models  for  wall- 
bounded  flow  require  knowledge  of  distance  to 
the  wall  for  use  in  “damping  functions”  which 
simulate  the  damping  effect  of  solid  walls  on  tur¬ 
bulence.  The  distance  required  in  these  models  is 
measured  in  “wall  units”  which  means  that  phys¬ 
ical  distance  from  the  wall  y  is  sealed  by  the  local 
wall  shear,  density,  and  viscosity. 


Twall  V 
Pwal I  V 


(6.1) 


Scaling  by  wall  quantities  is  yet  another  compli¬ 
cation  but  does  not  create  serious  implementa¬ 
tion  difficulties  for  unstructured  meshes  as  we  will 
demonstrate  shortly. 

6.2  A  One-Equation  Turbulence  Transport  Model 

In  a  recent  report  with  Baldwin  [53],  we  pro¬ 
posed  and  tested  (on  structured  meshes)  a  single 
equation  turbulence  transport  model.  In  this  re¬ 
port,  the  model  was  tested  on  various  subsouic 
and  transonic  flow  problems:  flat  plates,  airfoils, 
wakes,  etc.  The  model  consists  of  a  single  scalar 
advection-diffusiou  equation  with  source  term  for 
a  field  var  iable  which  is  the  product  of  turbulence 
Reynolds  number  anti  kinematic  viscosity,  vRr- 
This  variable  is  proportional  to  the  eddy  viscosity 
except  very  near  a  solid  wall. 


=  (C<2h  ~  Ct>)\A^/’P 

+(>'  4  — )V2(i vhr)  -  — (W,)  •  V(vRr) 

Or 

(6.2) 

In  this  equation.  P  is  the  production  of  turbulent 
kinetic  energy  and  is  related  to  the  mean  flow  ve¬ 
locity  ratc-of-strain  and  the  kinematic  eddy  vis¬ 
cosity  V(.  In  equation  (6.2),  the  following  func¬ 
tions  are  required: 


—  =(ctJ  - 

OR  -<Jt 

vt  =clt(vilr)DiD2 
fit  =pvt 

Di  =1  -  exp (~y+/A+) 
Di  =1  -  exp(-y+  /A%) 


(dUk  V 

\dx'k) 


P_  (dUiWj\dVi  2 
p  Vt\dxj  + dxj  dxj 

h(y+)  =—L  -f  (l  -  +  dm  ( vd^D2 

CC2  Ay1  l 

+  -J+ exp(-»+MP  Z>,)J 


The  following  constants  have  been  recommended 
in  [53]: 


k  =0.41,  ct,  =  1.2,  c£j  =  2.0 
c„  =0.09,  A+  =  26,  Af  =  10 


We  also  recommend  the  following  boundary  con¬ 
ditions  for  (6.2): 

1.  Solid  Walls:  Specify  Rr  =  0. 

2.  Inflow  (V  •  n  <  0):  Specify  Rj  =  (Rt)<x>  <  1- 

3.  Outflow  (V  -  n  >  0):  Extrapolate  Rr  from 
interior  values. 


Equation  (6.2)  depends  on  distance  to  solid 
walls  in  two  ways.  First,  the  damping  function 
f‘i  appearing  in  equation  (6.2)  depends  directly 
on  distance  to  the  wall  (in  wall  units).  Secondly, 
vt  depends  on  vRt  and  damping  functions  which 
require  distance  to  the  wall. 

vt  =  c^D1{y+)D2(y+)vRt 

It  is  important  to  realize  that  the  damping  func¬ 
tions  /2,Di,  and  D2  deviate  from  unity  only  when 
very  near  a  solid  wall.  For  a  typical  turbulent 
boundary-layer  (see  figure  6.1)  accurate  distance 
to  the  wall  is  only  required  for  mesh  points  which 
fall  below  the  logarithmic  region  of  the  boundary- 
layer. 


6-57 


Figure  6.1  Typical  flat  plate  boundary-layer  from 
ref.  [53]  showing  dependence  of  turbulence  model 
on  distance  to  wall. 

The  relative  insensitivity  of  distance  to  the  wall 
means  that  accurate  estimation  of  distance  to  the 
wall  is  only  required  for  a  small  number  of  points 
that  are  extremely  close  to  a  boundary  surface. 
The  remaining  points  can  be  assigned  any  approx¬ 
imate  value  of  physical  distance  since  the  damp¬ 
ing  functions  arc  essentially  at  their  asymptotic 
values.  A  general  procedure  for  calculation  of  dis¬ 
tance  to  the  wall  in  wall  units  is  to  precompute 
and  store,  for  each  vertex  of  the  mesh,  the  mini¬ 
mum  distance  from  the  vertex  to  the  closest  solid 
wall  (examples  are  shown  later  in  figures  6.2b  and 
G.3b).  This  strategy  can  only  fail  if  two  bodies  are 
in  such  close  proximity  that  the  near  wall  damp¬ 
ing  functions  never  reach  their  asymptotic  values. 
Realistically  speaking,  the  validity  of  most  tur¬ 
bulence  inodeis  wouid  be  in  serious  question  in 
this  situation  anyway.  In  general,  the  minimum 
distance  from  vertex  to  boundary  edge  does  not 
occur  at  the  end  points  of  a  boundary  edge  but 
rather  interior  to  a  boundary  edge.  For  each  ver¬ 
tex,  information  concerning  which  boundary  edge 
achieves  the  minimum  distance  and  the  weight 
factor  for  linear  interpolation  along  the  boundary 
edge  must  be  stored.  Data  can  then  be  interpo¬ 
lated  to  the  point  of  minimum  distance  whenever 
needed.  In  the  course  of  solving  (6.2),  distance  to 
a  solid  wall  in  wall  units  is  calculated  by  retriev¬ 
ing  physical  distance  to  the  wall  and  the  local  wall 
quantities  needed  for  (6.1)  as  interpolated  along 
the  closest  boundary  edge. 

The  numerical  calculations  presented  in  this 
section  represent  a  successful  application  of  the 
ideas  discussed  in  previous  sections.  Figures  6.2a 
and  6.3a  show  examples  of  Min-Max  triangula¬ 
tions  about  single  and  multi-element  airfoils.  The 
first  geometry  consists  of  a  single  RAE  2822  air¬ 


foil.  Navier-Stokes  flow  is  computed  about  this 
geometry  assuming  turbulent  flow  with  the  fol¬ 
lowing  free-stream  conditions:  Moo  =  .725,  a  — 
2  31°,i?e  =  6.5  million.  Wind  tunnel  experi¬ 
ments  for  the  RAE  2822  geometry  at  these  test 
conditions  have  been  performed  by  Cook,  Mc¬ 
Donald,  and  Firmin  [54].  The  RAE  2822  airfoil 
mesh  shown  in  figure  6.2a  contains  approximately 
14000  vertices  and  41000  edges.  The  second  ge¬ 
ometry  consists  of  a  two  element  airfoil  configu¬ 
ration  with  wind  tunnel  walls.  The  inflow  condi¬ 
tions  assume  turbulent  flow  with  Moo  =  -09  and 
Re  =  1.8  million.  Details  of  the  geometry  and 
wind  tunnel  test  results  can  be  found  in  the  re¬ 
port  by  Adair  and  Horne  [55].  The  two  element, 
mesh  shown  in  figure  6.3a  contains  approximately 
18000  vertices  and  55000  edges. 

Both  meshes  were  constructed  in  two  steps. 
The  first  step  was  to  generate  a  Delaunay  trian¬ 
gulation  of  the  point  cloud.  As  mentioned  earlier, 
the  method  of  Delaunay  triangulation  can  gener¬ 
ate  poor  triangulations  for  highly  stretched  point 
distributions.  Both  meshes  suffered  from  nearly 
collapsed  triangles  with  two  small  interior  angles. 
As  a  second  step,  a  Min-Max  triangulation  was 
constructed  by  edge  swapping  the  Delaunay  tri¬ 
angulation.  Edge  swapping  repaired  both  trian¬ 
gulations.  Both  airfoil  geometries  were  calculated 
assuming  turbulent  flow  using  the  one-equation 
turbulence  model  (6.2).  Level  sets  of  the  gen¬ 
eralized  distance  function  used  in  the  turbulence 
model  are  shown  in  figures  6.2b  and  6.3b. 


Figure  6.2a  Mesh  near  RAE  2822  airfoil. 
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airfoil. 


Figure  6,2b  Contours  of  distance  function  for 
turbulence  model. 


Figure  6.2c  Closnup  of  Mach  number  contours 
near  airfoil. 

Figures  6.2c-d  plot  Mach  number  contours 
and  pressure  coefficient  distributions  for  the  RAE 
2822  airfoil.  The  pressure  coefficient  distribution 
compares  favorably  with  the  experiment  of  Cook, 
McDonald,  and  Firmin  (29).  Leading  edge  trip 
strips  were  used  on  the  experimental  model  but 
not  simulated  in  the  computations.  This  may  ex¬ 
plain  the  minor  differences  in  the  leading  edge 
pressure  distribution. 


Navier-Stokes  computations  for  the  two  el¬ 
ement  airfoil  configuration  as  shown  in  figures 
6.3c-d.  The  effects  of  wind  tunnel  walls  have  been 
modeled  in  the  computation  by  assuming  an  in- 
viscid  wall  boundary  condition.  Mach  number 
contours  are  shown  in  figure  6.3c.  Observe  that 
the  contours  appear  very  smooth,  even  in  regions 
where  the  mesh  becomes  very  irregular.  This  is 
due  to  the  insistance  that  linear  functions  be  ac¬ 
curately  treated  in  the  flow  solver  reguardless  of 
mesh  irregularities. 


Figure  6.3a  Mesh  near  Multi-element  airfoil. 


6-5M 


Figure  6.3b  Contours  of  distance  function  for 
turbulence  model. 


Figure  6.3c  Closcup  of  Mach  number  contours 
near  airfoil. 

Pressure  coefficient  distribution  on  the  main 
airfoil  and  flap  are  graphed  in  figure  6.3d.  Com¬ 
parison  of  calculation  and  experiment  on  the  main 
element  is  very  good.  The  suction  peak  values  of 
pressure  coefficient  on  the  flap  element  are  slightly 
below  the  experiment.  The  experimentors  also 
note  a  small  separation  bubble  at  the  trailing  edge 
of  the  flap  which  was  not  found  in  the  computa¬ 
tions. 
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1  SUMMARY 

This  contribution  presents  high  resolution  upwind  finite 
volume  schemes  based  on  a  polynomial  reconstruction 
of  the  unknowns  on  unstructured  polygonal  cells.  The 
schemes  are  Essentially  Non-Oscillatory  (ENO)  through 
the  use  of  an  adaptive  stencil  selection.  A  complete  anal¬ 
ysis  and  comparison  given  in  this  text  shows  that  the  or¬ 
der  of  the  space  accuracy  of  the  schemes  is  at  least  equal 
to  the  degree  of  the  reconstruction  polynomial.  Numeri¬ 
cal  results  arc  shown  for  a  nonlinear  hyperbolic  conserva¬ 
tion  equation,  confirming  the  ENO  shock  capturing  and 
higher  order  accuracy  on  highly  irregular  grids.  More  re¬ 
alistic  Euler  calculations  will  demonstrate  the  ability  of 
the  concepts  that  arc  outlined  theoretically. 

2  INTRODUCTION 


<  * >n  +  h  /r  Piidr  =<  S{q'  ^ >n  (3) 

in  which  n  is  the  outward  pointing  unit  normal  at  the 
boundary  I'  of  0  and  <  .  >n  is  an  averaging  operator 
over  the  domain  ft.  The  flux  P.ii  will  be  denoted  by  //. 
In  the  case  of  the  Euler  equations,  q,  II  and  S  are  given 
as  follows: 
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The  aim  of  the  present  work  is  to  investigate  adaptive 
unstructured  grid  Finite  Volume  methods  for  the  com¬ 
putation  of  2D  viscous  compressible  flows.  A  general  ap¬ 
proach  is  followed,  based  on  unstructured  polygonal  finite 
volumes  with  an  arbitrary  number  of  edges  and  with  a 
cell  centered  definition  of  the  unknowns,  ref.  [1,  2].  This 
includes  as  a  subset  the  classical  structured  grids  based 
on  quadrilaterals  as  well  as  the  unstructured  grids  based 
on  triangles.  All  types  of  meshes  can  be  encapsulated  in 
just  one-  and  the  same  datastructurc  which  was  cast  into 
an  object-oriented  database  written  in  C-F-4-  in  view  of 
higher  order  schemes.  This  text,  however,  will  not  treat 
convection-diffusion  problems  but  only  purely  hyperbolic 
problems  of  the  following  form: 

ff  +  $.P(q,?)  =  S(q,r)  (1) 

in  which  q  is  a  set  of  solution  variables,  f  is  any  location  in 
flic  domain  Q  where  the  equation  is  to  be  solved  and  S  is 
a  source  term.  The  vector  P  =  (F,  G)  contains  the  fluxes 
of  the  convection  quantity  in  the  x-  and  y-dircction. 

Eq.  (1)  can  be  cast  in  integral  form  using  Green’s  theorem 
as: 

i  Jh  q.dQ  -f  jrP(q,f).n.dT  =  J ^S(9,f).dn  (2) 
or,  after  division  by  0: 
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with  «„  =  u.n  the  velocity  in  the  normal  direction, 
n  =  («,,»„),  p  the  density,  u  and  v  the  velocity  com¬ 
ponents,  e  the  total  energy  per  unit  mass,  p  the  static 
pressure.  The  next  relations  exist  between  e,  p  and  the 
temperature  T: 

p  =  p.(y  —  1)  ,  (e  - 

P  =  p.R.T 

c*  =  y.R.T  (5) 

with  R  the  universal  constant  for  gases,  y  the  ratio  of  the 
specific  heats  at  constant  pressure  respectively  constant 
temperature  and  c  the  speed  of  sound. 

In  the  next  section,  a  description  is  given  of  a  more  classic 
compressible  Euler  solver  for  adaptive  unstructured  grids. 
These  solvers  are  based  on  either  a  constant  or  a  linear 
representation  of  the  solution  in  each  volume,  see  [3]  and 
therefore,  they  can  only  achieve  a  space  accuracy  which 
is  in  between  order  one  and  order  two.  The  solution  rep¬ 
resentation  is  discontinuous  across  cell  interfaces  and  the 
values  on  both  sides  of  each  cell  interface  are  used  as  the 
initial  values  for  a  local  Riemann  problem.  In  the  case  of 
linear  solution  representations,  cither  limiters  or  artificial 
dissipation  are  introduced  in  order  to  maintain  mono¬ 
tonicity  in  the  neighbourhood  of  discontinuities.  How¬ 
ever,  these  techniques  locally  reduce  the  space  accuracy 
of  the  scheme  to  first  order.  These  schemes  arc  said  to  be 
Total  Variation  Diminishing  (TVD),  see  ref.  [4,  5,  6].  Ex¬ 
tending  these  solvers  towards  Navier-Stokes  calculations 
reveals  a  strong  dependency  of  the  obtained  solutions  on 
the  local  mesh  quality.  This  is  particularly  true  when 


local  grid  refinements  have  been  applied  in  order  to  have 
a  better  spatial  resolution. 

One  of  the  possibilities  to  reduce  the  mesh  sensitivity  is 
to  increase  the  spatial  accuracy  of  the  schemes  used.  To 
achieve  this,  the  solution  representation  (reconstruction) 
in  each  finite  volume  must  be  more  accurate.  Barth  [7,  8) 
proposed  in  this  context  to  reconstruct  the  solution  in 
each  control  volume  by  a  polynomial  of  higher  degree. 
Based  on  his  ideas,  two  variants  of  his  higher  order  poly¬ 
nomial  reconstruction  algorithm  wiii  be  presented,  Some 
results  produced  by  these  algorithms  and  a  numerical  er¬ 
ror  study  will  be  shown. 

A  theoretical  study  concerning  the  use  of  these  two  algo¬ 
rithms  in  an  upwind  finite  volume  solver  is  carried  out  in 
a  separate  section.  This  study  reveals  that  schemes  us¬ 
ing  polynomial  solution  reconstruction  are  higher  order 
accurate  in  space  regardless  the  local  grid  quality.  This  is 
crucial  for  solution  adaptive  grid  procedures  as  they  in¬ 
troduce  sudden  and  considerable  changes  in  the  cell  size 
and  shape. 

Note  that  other  approaches  exist  to  reduce  the  grid  sen¬ 
sitivity  in  the  literature.  Essers  and  Ilenard  (9]  for  in¬ 
stance,  apply  a  non-conservative  correction  to  the  flux 
based  on  the  numerically  computed  coefficients  in  the 
local  truncation  error.  However,  as  conservation  is  no 
longer  guaranteed,  discontinuities  will  not  exhibit  the  cor¬ 
rect  jump. 

Struijs  and  Dcconinck  [10,  11]  presented  their  so  called 
fluctuation  splitting  schemes  whereby  the  total  net  flux  in 
a  triangle  is  sent  to  its  3  nodes  according  to  the  direction 
of  the  convection  speeds.  Several  wave  models  were  set  up 
to  find  these  directions  and  showed  to  be  very  performing 
in  the  case  of  contact  discontinuities  on  irregular  meshes. 

It  is  also  well  known  that  schemes  with  a  non-constant  so¬ 
lution  reconstruction  arc  not  monotone  and  therefore  net 
stable.  In  order  to  stabilize  these  higher  order  schemes, 
nonlincaritics  have  to  be  introduced.  For  the  classical 
TVI)  schemes,  this  is  achieved  by  limiting  the  solution 
reconstruction,  ref.  [3,  6,  12].  In  the  present  work,  a  dif¬ 
ferent  path  is  followed.  The  support  of  the  reconstruc¬ 
tion,  i.e.  the  set  of  cells  used  to  compute  the  coefficients 
of  the  reconstruction  polynomial,  is  arbitrary;  this  degree 
of  rccdom  leaves  the  room  to  choose  the  support  in  an 
adaptive  way  to  stabilize  the  scheme.  The  nonlinearity 
lies  then  in  the  tact  that  cells  are  accepted  or  rejecter! 
from  the  support  after  each  iteration.  Harten  et  al.  [13] 
proposed  an  adaptive  selection  of  the  stencil  which  allows 
to  define  the  polynomial  in  each  control  volume  such  that 
the  derivatives  of  the  reconstruction  polynomial  are  min¬ 
imal.  Their  ID  analysis  showed  that  this  selection  leads 
to  the  so  called  ENO-schemcs,  i.e.  schemes  in  which  the 
occurring  oscillations  are  only  of  order  k,  the  degree  of 
the  reconstruction  polynomial. 

Shu  and  Oshei  [14]  extended  this  method  to  two  dimen¬ 
sional  cartesian  meshes  using  a  dimension  by  dimension 
reconstruction,  while  Durlofsky  et  al.  [15]  used  linear  re¬ 
construction  with  adaptive  stencils  on  triangular  meshes. 
In  the  present  contribution,  two  generalizations  of  these 
support  selection  algorithms  towards  unstructured  grids 
consisting  of  polygonal  cells  will  be  discussed  in  more 
detail,  A  first  one  is  based  on  some  marching  proce¬ 
dure  whereby  the  marching  direction  is  based  on  some 
heuristic  criteria  such  that  both  the  first  order  gradient 
of  the  reconstruction  and  the  variation  of  that  first  gra¬ 
dient  over  the  whole  of  the  stencil  are  minimal.  A  second 
selection  algorithm  suggests  a  finite  number  of  candidate 
supports  and  selects  then  the  one  with  minimal  norm  of 
all  derivatives  up  to  an  order  equal  to  the  order  of  the 
reconstruction.  Other  methods  are  given  in  the  litera¬ 


ture  by  Harten  and  C'hakravarlhy  [16]  which  minimize 
mathematically  all  derivatives  up  to  order  k. 

In  a  final  section,  numerical  results  obtained  using 
schemes  with  ENO-rcconstrnction  will  be  presented  for 
nonlinear  scalar  convection  problems  as  well  as  for  the 
compressible  Euler  equations. 

3  A  CLASSIC  TVD  SCHEME 

3.1  Introduction 

Before  introducing  polynomial  reconstruction  in  section  4 
and  the  schemes  that  can  be  built  with  it  (section  5),  a 
classic  TVD-scheme  for  adaptive  unstructured  grids  is 
discussed  in  this  section.  This  discussion  shows  approx¬ 
imately  the  current  status  in  finite  volume  solvers,  their 
capabilities  and  their  shortcomings.  These  shortcomings 
arc  to  be  situated  in  the  fact  that  finite  volume  solvers  arc 
sensitive  to  grid  irregularities  and  that  their  spatial  ac¬ 
curacy  is  reduced  to  first  order  because  of  the  use  of  lim¬ 
iters  or  artificial  dissipation.  ENO-schemcs  as  described 
in  sections  5  and  6  try  to  cure  those  shortcomings. 

Concerning  the  notation,  q  denotes  always  the  exact  so¬ 
lution  of  a  given  analytical  problem  whereas  Q  always 
stands  for  some  discrete  solution  of  a  discretized  prob¬ 
lem. 

3.2  Discretization 

The  eqs.  (3)  and  (4)  can  be  discretized  while  defining 
a  set  of  non-overlapping  polygonal  finite  volumes  with 
area  Up  which  cover  the  computational  domain  ft.  The 
discrete  unknowns  Qp  are  associated  with  the  gravity 
center  of  the  cell.  This  leads  to  the  next  discrete  finite 
volume  equations  that  have  to  be  satisf.sd  for  each  cell 

9.R..-..9.L  -f  ~  £  //pn.Aspft  =  0  (6) 

where  the  summation  extends  over  all  the  neighbouring 
cells  R  adjacent  to  cell  P,  fig.  1 . 


Fig.  1:  Flux  at  the  interface  between  cell  P  and R. 

Here,  Aspr  is  the  length  of  the  edge  common  to  cells 
P  and  R.  The  superscript  n  stands  for  the  current  time 
level  where  the  the  time  t  is  equal  to  n.  At.  As  the  interest 
here  goes  mainly  to  the  space  accuracy  of  the  scheme,  the 
reader  is  referred  to  ref.  [3]  for  more  a  complete  descrip¬ 
tion  of  the  construction  of  explicit  and  implicit  schemes 
based  on  this  type  of  space  discretization.  Hpp  is  the 
numerical  flux  through  the  edge,  defined  by  an  approxi¬ 
mate  Riem&nn  solver,  ref.  [17].  In  the  present  approach, 
Van  Leer’s  flux  vector  splitting  and  Roe’s  flux  difference 
splitting  are  adopted  as  approximate  Rieraann  solvers  in 
order  to  obtain  an  upwind  treatment  of  the  flux.  We  have 
for  Van  Leer  [18]: 


//mi  =  //+wfc) +/'”«;)  (?) 

and  for  Roc's  splitter  [19]: 

//<*«  =  |  [//(Qji)  +  //(4?«)1 

-i|c...(Qm)l(Q«-Qii)  (6) 

in  which  Cm  is  the  Jacobian  taken  at  the  Roc- 
averaged  state  Qm  (Q  j„  Qn).  In  c<|9.  (7)  and  (8),  Qf, 
and  Qjl  indicate  any  type  of  extrapolation  of  the  un¬ 
knowns  in  cell  P  reap.  It  from  the  gravity  centers  of 
these  cells  to  the  interface  between  these  two  cells.  How¬ 
ever,  in  classic  finite  volume  schemes,  only  constant  or 
linear  extrapolation  is  employed: 


Qt>  =  Qp 

Qn  -  Qn  19) 

in  the  case  of  constant  solution  representation  and 

Qp  ~  Qp  +  (re-  -  rp).$Qp 

Qn  ~  Qn  +  (fo  -  fn).^QR  (10) 


for  linear  solution  reconstruction,  whereby  'VQp  and 
fV>,j  arc  assumed  to  be  constant  gradient  vectors  for 
cell  P,  tesp.  cell  II.  The  vector  ?o  points  to  the  mid¬ 
point  O  of  the  edge  between  the  cells  P  and  It. 

On  an  unstructured  mesh,  the  classic  finite  difference  for¬ 
mulas  for  approximating  the  first  derivatives  of  the  dis¬ 
crete  solution  Q  in  eq.  (10)  cannot  be  used.  An  approxi¬ 
mation  for  the  gradient  of  Q  over  a  given  polygon  P  can 
be  found  using  the  Gauss  theorem,  fig.  2: 


(U) 


Indeed,  taking  P  =  (Q,  0),  and  assuming  ^  constant 
over  the  polygon  P,  eq.  (11)  yields: 


and  the  contour  integrals  can  easily  be  discretized  as  a 
sum  over  the  edges  of  the  control  volume: 


Q  Wjt  ds  =  £  Q'  (»•*),  As i 


(13) 


where  Q *  is  taken  as  the  average  of  Q  between  the  two 
nodes  defining  edge  »: 


o*  —  £i  A 

Wi  “  2 


(H) 


On  a  mesh  with  cell- centered  storage  of  the  unknowns, 
the  integration  contour  for  estimating  the  gradient  in  a 
given  cell  P  is  defined  in  lig.  3, 


Fig.  3:  Integration  contour. 

Note  that  the  same  method  is  used  in  a  cell  adjacent  to 
a  boundary  of  the  domain,  except  that  the  integration 
contour  is  no  longer  ccntr-l  with  respect  the  cell  t ,  fig. 
4.  Furthermore,  additional  unknowns  are  stored  halfway 
the  edges  representing  the  boundary.  In  the  next  sub¬ 
section,  it  will  be  shown  that  these  additional  unknowns 
ate  also  useful  for  treating  the  boundary  conditions  more 
consistently. 


Fig.  4:  Contour  close  to  a  wall. 

Using  the  gradient  computed  in  the  above  manner,  it  is 
possible  to  establish  a  piecewise  linear  reconstruction  of 
the  discrete  solution  over  a  cell;  i.c.,  for  any  point  fin  cell 
P,  we  obtain  a  linear  lepresentation  Qp'(r)  of  the  state 
variables  given  by  eq.  (10).  If  P  is  taken  at  the  centroid 
of  the  cell,  it  is  easy  to  verify  that  this  linear  variation  is 
just  a  redistribution  of  the  cell  averaged  data: 


(the  subscript  m  denoting  some  point  inside  the  polygon 
P)',  in  the  same  way,  one  finds: 


1 


ftp 


Qnydn 


(12) 


<Q^\r)>p~Q(rP)  =  QP  (15) 

i.e.  cell  averages  and  values  at  the  gravity  center  are  ex¬ 
changeable.  Hence,  it  was  indeed  justified  to  use  values 
at  the  gravity  center  in  eq.  (6)  when  discretizing  eq.  (3). 


It  is  also  im|>ortant  to  point  out  hero  that  the  linear  re¬ 
construction  as  presented  here  will  exactly  reconstruct 
the  exact  solution  q  of  c<|.  (3)  if  j  is  linear.  'I'o  obtain 
this,  one  needs  only  the  values  qr  —  q(ro,,)  of  the  so¬ 
lution  at  the  centroid  Gp  of  each  cell  P.  Schemes  with 
this  property  arc  called  linearity  preserving  or  k- exact, 
sec  [8], 

Finally,  remark  also  that  the  scheme  given  by  eq.  v*>) 
whereby  the  gradients  in  eq.  (10)  arc  estimated  as  de¬ 
scribed  above,  it  equivalent  to  the  Fromm  scheme  (20) 
when  applied  in  one  dimension.  The  Fromm  scheme  is 
a  member  of  the  family  of  the  so  called  x-schcmes  (21) 
whereby  x  =  0.  The  Fromm  scheme  is  upwind  biased  in 
the  sense  that  the  fluxes  in  oq.  (7)  and  (8)  arc  taken  on 
one  side  while  the  stencil  to  calculate  the  extrapolated 
solution  variables  is  central. 


3.3  TVD-Limiters 

Schemes  using  a  piecewise  linear  reconstruction  are  said 
to  be  second  order  accurate  in  space  on  regular  grids  bat 
are  unstable  or  at  least  they  are  highly  oscillatory  near 
discontinuities.  In  order  to  achieve  TVD  properties  for 
those  schemes,  the  gradient  slopes  used  for  Q'f,  and  Qr 
in  eq.  (10)  are  limited  as  follows: 


=  Qp  +  Vpr -'QQi-^o  -  ?r) 

Qr  —  Qr  +  Vrp  •  $Qr-  (?o  -  ?r)  (16) 

where  y>  is  the  limiting  function  whose  value  is  in  most 
of  the  eases  to  be  found  in  the  interval  (0, 1],  As  an 
example,  van  Albada  [6]  proposed  the  following  form  for 
the  limiting  function: 


where 

r  =  si9n(a.l >).y/Lli«£  (18) 

in  which 

a  =  (Qr  -  Qr)-lfJ* JVkLchI  (19) 

for  the  computation  of  Qj-  and 

«  =*  (Qr  -  Qr)  .  /->0\ 

b  =  ?QR.(F-rn) 

for  the  evaluation  of  Qr.  The  symbol  t  denotes  a  small 
positive  number  that  avoids  division  by  zero.  Other  lim¬ 
iters  atv  given  in  [6,  12]. 

Note  that  the  limiting  can  be  performed  one  any  type  of 
variables:  conservative  Q,  primitive  Q  or  characteristic 
IV  variables.  For  this  purpose,  the  symbol  Q  should  be 
replaced  in  the  above  equations  by  the  proper  one:  Q,  Q 
or  W. 

The  use  of  equations  (16)-(20)  guarantees  that  Qf,  and 
Qh  will  have  a  value  in  the  interval  (Q/>,Vo]  resp. 
[<2o,<?«].  where  Qo  is  defined  as: 


Qo 


Qr  +  (Qr  -  Qr)- 
Qr  +  (Qp  -Qr)- 


I?  ~  ?>1 

|r-  fp|  +  |f-  r«| 

|r  -  Fn| 

|r-  f>|  +  |r  -  fj*| 


(21) 


Fig.  5:  ID  Representation  of  the  limiting  procedure. 


In  11),  the  value  of  Qo  corresponds  to  a  linear  interpo¬ 
lation  at  the  cell  interface  between  the  values  of  Qp  ami 

Qr,  fig-  5- 

By  restricting  the  values  of  Qf,  and  Qr,  i.c.  for  values  of 
the  function  y>  close  to  zero,  oscillations  arc  suppressed  in 
the  neighbourhood  of  zones  where  large  gradients  occur 
because  the  scheme  turns  locally  back  to  first  order. 

3.4  Boundary  Equations 

3.4.1  Flux  Equality  Method 

The  flux  Kpn  in  eq,  (6)  through  edges  at  the  boundary 
of  the  domain  arc  computed  by  introducing  additional 
unknowns  along  each  boundary  edge  (tig.  6). 


Fig.  6:  Boundary  edge. 

According  to  [22],  the  boundary  unknowns  Qr  are  de¬ 
termined  by  solving  the  following  algebraic  consistency 
equation  for  the  boundary  flux  Ii r « ,  c.g.  for  van  Leer 
splitting  (see  eq.  (7)): 

H  PR  =  U*(Qp)+  ir(Qn)  =  H'(Qp)  (22) 

where  Qr  is  the  adjacent  cell  variable  and  //*(Q)  a 
boundary  flux  function  which  satisfies  the  boundary  con¬ 
ditions.  In  other  words,  the  numerical  flux  JIpr  must 
equal  the  physical  flux  //*  at  the  boundaries.  E  g.  for 
a  solid  wall,  enforcing  zero  normal  velocity,  the  fluxes 
become: 


n*(Qp)  = 


0 

p(Qp)-»j 

l»(Qr)«v 

0 


It  has  been  shown  by  Dcconinck  et  al.  [22]  that  this 
boundary  condition  treatment  is  a  characteristic  bound¬ 
ary  condition  treatment  in  the  case  of  a  linear  hyperbolic 
system.  For  the  nonlinear  system  it  is  an  approximate 


characteristic  boundary  condition  consistent  with  the  in¬ 
terior  approximate  Hicm.xnn  solver. 

3.4.2  Flux  BaIauco  Method 

As  proposed  by  Degrez  (23],  one  updates  the  variables 
in  » •  •  interior  cells  adjacent  to  the  walls  separately  fro  .1 
tho-se  being  completely  inside  the  mesh. 

First,  all  fluxes  through  interior  edges  are  computed  as 
usually.  In  order  to  update  the  variables  in  an  interior  ceil 
P  adjacent  to  the  wall,  one  uses  the  flux  balance  equa¬ 
tion  (6).  However,  the  fluxes  entering  the  cell  through 
its  boundaries  that  coincide  with  a  physical  boundary 
arc  yet  unknown.  Isolating  the  known  fluxes  in  the  (lux 
balance  equation  (3)  gives: 


the  boundaries.  The  boundary  conditions  would  not 
be  matched  exactly. 

3.  numerical  experiments  revealed  that  some  underre- 
laxatiou  of  the  extrapolations  was  requested  to  en¬ 
sure  convergence  when  second  order  space  accuracy 
was  used  inside  the  mesh,  equation  (?<>)  becomes 
then: 


[«»“>* 1  -!•  u>  .  (rtv  -  rr)j  nr.w  —  0  (2V) 

with  u>  the  relaxation  parameter  having  a  value  in 
the  interval  [0, 1]. 


<  </,  >r  +  -j-  /  IUV  +  [  II JV  a  0 

Jon,.\on,.  w  Jon,. 

(23) 

with  II  —  !'  tt  and  <)d  vv  denoting  the  edge  on  the  do¬ 
main  boundary.  After  discretizing,  one  finds: 


Qrn 


II i’R  &3rn  +  Wp.tv.Aar.tv I 

\n*w  j 

(24) 


Calling  the  lirst  (known)  summation  in  the  right  hand 
side  and  writing  the  equations  in  full  whereby  AQi>  = 
•-  Q'r  i  the  next  set  of  equations  is  found  when  deal¬ 
ing  with  a  solid  wall: 


A<?,.  =  7,  - 


At 

ih‘ 


o 

)Hv .»«* 

J>l*’.«y 


.A.sj'.tv 


(25) 


in  which  IV  denotes  the  flat  boundary  ceii  bordering  the 
current  cell  P.  One  has  now  a  set  of  4  equations  with  5 
unknowns:  4  unknowns  in  the  vector  A Qp  and  the  static 
pleasure  in  the  boundary  cell  IV,  pw . 

The  first  equation  (mass  conservation)  can  readily  he 
solved,  yielding  Ap>>  and  thus  p’/>+ 1 ,  and  so  can  the  fourth 
equation.  The  velocity  vector  fi£+l  and  the  wall  pres¬ 
sure  pvv  can  be  evaluated  only  if  an  additional  equation 
can  he  found.  The  flux  balance  method  consists  now  in 
stating  that  the  value  in  P  (at  time  step  n  +  1),  when 
extrapolated  to  the  boundary  cell  W  using  the  relevant 
gradient  in  the  cell  (as  of  time  step  «),  should  match  the 
boundary  condition.  Practically,  in  ease  of  the  flow  tan* 
gency  condition  at  a  solid  wall,  the  additional  equation 
reads 


(u,\+“  +  (tfuV,)  .  (fW  -  f,.)]  .ny.w  -  0  (26) 

T  his  equation  allows  now  to  evaluate  all  variables  in  the 
interior  cell  P  at  time  level  n  +  1.  However,  a  few  points 
should  be  emphasized: 

1 .  when  several  edges  of  the  interior  cell  P  arc  bound¬ 
ary  edges,  extrapolating  the  variables  to  each  of 
those  boundaries  is  impossible  (it  would  yield  dif¬ 
ferent  values  to  be  stored  in  the  cell  /’).  A  strategy 
should  be  devised  to  handle  such  cases,  e.g.  aver¬ 
aging  the  extrapolated  variables  over  all  boundary 

edgitS. 

2.  when  second  order  space  accuracy  is  used  inside 
the  mesh,  limiters  are  used  to  preserve  the  mono- 
tonicity  of  the  solution,  but  obviously  those  limiters 
should  not  be  used  when  extrapolating  variables  to 


3.5  Grid  Adaptivity 

In  order  to  have  sufficient  resolution  of  the  calculated 
solution  in  regions  of  high  truncation  error,  locally  more 
and  liner  cells  arc  needed.  In  the  present  work  this  was 
done  by  a  simple  grid  enrichment  technique,  based  on 
gradients  in  the  solution,  ref.  (1.  2].  A  given  cell  is  refined 
if  the  pressure,  or  alternatively  the  streamwise  entropy 
gradient  both  weighted  by  the  cell  area  satisfy 


I  l<‘  ftr  -  r  I  |i»  .  r 

RMS  ftm4;  ’  RMS  'fW 


(28) 


where  the  gradients  arc  sealed  to  the  RMS  value  over  the 
field,  and  C  is  a  user-specified  threshold.  The  cell  area 
Op  is  normalized  with  respect  to  the  maximum  cell  area 
Umax  over  the  field.  To  facilitate  the  choice  of  the  thresh¬ 
old  C,  a  histogram  of  the  gradient  distribution  over  the 
mesh  cells  is  calculated  as  proposed  in  [24],  lienee,  in¬ 
stead  of  the  threshold,  the  fraction  of  the  total  number 
of  cells  to  be  refined  is  given  as  an  input  for  the  refining. 
Furthermore,  a  preliminary  view  of  the  refined  grid  can 
he  obtained  by  plotting  the  grid  (while  connecting  the 
gravity  center  of  all  cells  satisfying  condition  (28)  to  the 
middle  of  their  edges  according  to  the  refinement  strat¬ 
egy  described  below).  This  view  can  help  to  tunc  the 
threshold  value  C  in  a  finer  way  without  having  to  spend 
large  amounts  of  computational  time  in  order  to  set  up  a 
new  dalastructiiic  associated  with  the  refined  grid. 


A  cell  is  refined  by  connecting  its  gravity  center  with  the 
middle  of  each  edge  (fig.  7a  and  7b).  If  the  interior  cell 
to  be  refined  lies  next  to  the  computational  boundary, 
♦he  boundary  cell  is  split  as  weii,  whereby  the  newly  cre¬ 
ated  boundary  cells  have  the  same  boundary  condition 
as  the  original  cell.  If  some  of  the  neighbouring  cells  are 
not  refined,  new  vertices  have  to  be  created  along  the  re¬ 
fined  edges,  which  increases  the  number  of  vertices  of  this 
neighbour  cell.  The  conservative  variables  in  the  newly 
created  cells  ate  copied  from  the  original  cell  P. 


Fig.  7a:  Refinement  in  the  interior  domain. 

The  present  refinement  strategy  may  lead  to  non-smooth 
grids,  but  has  the  important  merit  to  limit  the  remesh- 
ing  zone  to  exactly  the  region  targeted  by  the  refinement 
criterion. 
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Fin.  'b:  Refinement  next  to  a  domain  boundary. 


Anyhow,  the  non-smoothness  of  refined  grids  affects  the 
solution  quality  dramatically.  Especially  when  several  re- 
linements  are  performed,  unacceptable  grid  irregularities, 
called  "spiders",  appear.  "Spiders''  occur  in  a  cell  which 
has  not  yet  been  refined  in  the  previous  refinements  while 
some  of  its  neighbours  did.  Connecting  now  its  gravity 
center  with  the  middle  of  each  (!)  of  its  edges  when  re¬ 
fining  the  cell,  leads  to  the  creation  of  a  "spider"  (fig.  8) 
consisting  of  cells  with  an  odd  shape. 


Fig,  8:  Generation  of  a  "spider". 

This  is  cured  bv  introducing  feeef  dependent  refinement. 
This  involves  assigning  a  level  to  each  grid  cell  and  each 
grid  vertex.  A  cell  is  then  refined  as  if  all  of  its  neigh¬ 
bouring  cells  have  the  same  level,  and  no  "spiders"  occur 

(lig-  !»)• 


Fig.  9:  Level  dependent  refinement. 

The  level  of  all  cells  and  vertices  of  the  starting  grid  is 
equal  to  1.  Fig  10  shows  how  the  levels  are  assigned  after 
refinement  of  a  cell.  Storage  of  the  grid  cell  levels  is  not 
to  be  seen  as  a  drawback  since  these  levels  will  be  needed 
for  a  future  uudtigrid  implementation. 

Furthermore,  before  the  refinement  starts,  some  flagged 
cells  for  refinement  loose  their  Hag  again  in  order  to  avoid 
isolated  refined  cells  and  spits  of  refined  roues  into  non- 
refiued  zones.  The  algorithm  devised  for  this  turns  off 
(he  flag  of  refinement  of  a  cell  if  at  least  all  but  one  of  its 
neighbours  will  not  be  refined,  fig.  11  Obviously,  several 
scans  will  be  needed  before  lire  above  rule  will  be  fully 
satisfied. 


1,2:  Cell  Levels 

1,2:  Vertex  Levels 


Fig.  10:  Assigning  levels  to  the  new  cells  and  vertices. 


On  the  other  hand,  in  order  to  avoid  islands  and  creeks 
of  non- refined  cells  and  a  too  large  deference  in  cell  size 
between  a  cell  and  its  neighbours,  the  following  rules  are 
applied  after  the  refinement  is  finished: 

1.  A  cell  is  to  be  relined  additionally  when  ail  of  its 
neighbours  have  a  higher  level  (fig.  12a). 

2.  A  cell  is  to  be  refined  additionally  when  all  except 
one  of  its  neighbours  have  a  higher  level  (fig.  12b). 

.1.  A  cell  is  to  be  refined  additionally  when  the  level 
difference  between  the  cell  itself  and  one  of  its  neigh¬ 
bours  exceeds  one  (lig.  12c).  Fig.  12d  shows  how  it 
is  refined. 


Fig.  12a:  All  neighbours  were  refined. 


Fig.  12b:  All  neighbours  except  one  are  at  a  higher 
level. 

Again,  several  scans  of  a  refined  grid  are  needed  to  sat¬ 
isfy  all  the  rules,  specially  when  there  arc  deep  creeks. 
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Fig.  11:  Rules  for  avoiding  islands  and  pits  uf  refined  cells. 


The  computation  was  started  on  an  initial  mesh  contain¬ 
ing  154  cells  as  shown  in  fig.  15.  Not:  that  the  mesh 
was  copied  twice  in  this  figure  in  order  to  achieve  a  bet¬ 
ter  visual  impression  of  the  cascade.  Refining  the  grid  5 
times  on  basis  of  the  velocity  projected  entropy  gradients 
in  each  intermediate  solution,  the  final  mesh  contained 
9333  cells,  fig.  16. 

As  can  be  seen  in  fig.  17a,  the  isentropic  Mach  number 
distribution  obtained  from  the  filial  grid  hardly  exhibits 
any  oscillation.  However,  note  that  the  dip  in  the  distri¬ 
bution  in  the  lower  right  corner  of  the  figure  is  due  to  the 
fact  that  the  impinging  shock  in  point  A  of  the  wedge 
is  not  perfectly  cancelled  by  the  expansion  wave  in  the 
same  point.  This  is  presumed  to  be  caused  by  round-off 
errors  in  the  specification  of  the  geometry.  Comparison 
of  fig.  17a  with  the  result  by  Holmes  [26]  as  given  in 
fig.  17b  reveals  clearly  the  monotonic  character  of  the 
present  scheme.  Holmes’  result  however  does  not  show 
the  dip  in  the  lower  right  corner  of  the  figure.  Both  meth¬ 
ods  have  difficulties  in  predicting  the  exact  value  of  the 
isentropic  Mach  number  after  the  expansion  wave  in  point 
B  of  the  wedge. 

Comparison  of  the  isentropic  Mach  lines  obtained  with 
the  two  methods  shows  again  that  the  cancelling  of  the 
impinging  shock  and  the  expansion  shock  in  point  A  is 
not  complete  with  the  present  method,  fig.  18a  and  18b. 


These  a  priori  and  a  posteriori  scans  of  the  mesh  to  be 
adapted  decrease  the  precise  control  on  the  regions  where 
the  grid  is  to  be  refined,  but  the  smoothness  of  the  grid 
is  enhanced  considerably.  Finally,  one  of  the  main  rea¬ 
sons  to  use  a  nested  refining  as  described  here  instead 
of  a  complete  (solution  adaptive)  remeshing  is  the  future 
implementation  of  a  multigrid  acceleration  technique. 

3.6  Results 

This  subsection  presents  the  results  obtained  from  a  cal¬ 
culation  on  a  2D  supersonic  staggered  wedge  cascade. 
This  test  case,  for  which  an  analytical  solution  exists  was 
proposed  by  Denton  [25];  the  upstream  Mach  number  is 

1.6  and  the  incidence  is  60°.  The  geometry  of  the  prob¬ 
lem  is  given  in  fig.  13  while  the  isentropic  Mach  number 
distribution  on  the  profile  predicted  by  the  analytical  so¬ 
lution  is  shown  in  fig.  14.  Comparisons  will  be  made  with 
the  results  of  the  computation  by  Holmes  [26]. 

The  computatiin  was  done  using  a  Van  Leer  approxi¬ 
mate  Ricmann  solver,  [18].  In  order  to  perform  the  time 
integration  a  point  GaussScidel  relaxation  technique  is 
used,  see  ref.  [3],  Van  Albada’s  flux  limiter  was  applied 
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Fig.  13:  The  Geometry  of  the  Problem. 


Mach  _  Suction  side 

number  _ Pressure  side 
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Fig.  14:  The  Analytical  Isentropic  Mach  Number  Distribution 


Fig.  15:  The  Initial  Mesh  (154  Cells). 


Fig.  16:  The  Final  Mesh  (9333  Cells). 
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4  RECONSTRUCTION 
ALGORITHMS 


4.1  Introduction 


The  estimate  of  the  solution  variables  at  the  cell  inter¬ 
faces  based  on  linear  extrapolation  given  by  cq.  (10)  be¬ 
comes  inaccurate  for  non-linear  functions  and  the  idea 
proposed  by  Barth  [7,  8]  is  .o  perform  a  higher  order  ex¬ 
trapolation  which  is  still  exact  for  polynomial  functions 
g(z,j/)  of  higher  degree  over  the  domain  0.  In  turn, 
this  higher  order  extrapolation  will  allow  for  more  ac¬ 
curate  flux  evaluations  at  the  cell  interface.  A  variant  of 
Barth’s  reconstruction  algorithm  based  on  zero-mean  ba¬ 
sis  polynomials  (Zero-Mean  Higher  Order  Reconstruction 
or  briefly:  ZM-110R)  is  presented  in  this  section  together 
with  a  new  reconstruction  algorithm  using  basis  polyno¬ 
mials  with  a  zero-value  at  some  reference  point  of  the 
cell  where  the  solution  is  to  be  reconstructed.  In  general, 
this  reference  point  will  be  the  gravity  center  of  the  cell 
(GC-HOR).  The  second  algorithm  is  believed  to  be  com¬ 
putationally  less  involving  since  less  inertial  moments  are 
to  be  computed. 


Note  that  solution  reconstruction  is  just  a  way  of  interpo¬ 
lating  a  discrete  set  of  given  point  values  or  mean  values. 
Other  interpolation  techniques  can  be  devised  such  as  in 
Harten  [16]  whereby  the  derivatives  of  a  truncated  Taylor 
series  expansion  are  determined  by  stating  that  the  mean 
of  this  series  in  each  support  cell  must  equal  the  respec¬ 
tive  given  discrete  mean  value.  Harten  also  developed 
the  GC  counterpart  of  this  algorithm. 


cell  P,  This  reference  point  is  in  general  cell  P's  gravity 
center  fap.  Furthermore,  the  function  has  to  satisfy 
the  following  requirements,  fig.  Id: 

•  it  must  conserve  the  mean  in  the  cell  P.  In  other 
words,  the  mean  value  of  Q*p*  over  the  cell  area  Up 
must  be  equal  to  the  discretized  average  value  Qp 
assigned  to  cell  P: 


Qp  =  ffr-Jf  =  <  >P  (29) 

•  it  must  represent  polynomial  functions  of  degree 
r<fc  exactly  over  the  whole  cell  area.  In  this  case, 
the  reconstruction  Q ^  over  the  complete  domain 
will  be  continuous  across  the  cell  edges.  If  r  >  k 
then  discontinuities  across  the  c'1"  edges  arc  allowed. 

Note  that  cell  P  can  be  either  an  inter  .  -ell  or  a  bound¬ 
ary  cell. 


Fig.  19:  Representing  the  solution  in  cell  P  with  a 
polynomial  function. 

In  the  following,  the  origin  of  the  coordinate  system  is 
always  translated  towards  some  reference  point  fp  asso¬ 
ciated  with  cell  P.  The  effect  of  round-off  errors  in  the 
implementation  is  attenuated  due  to  this  translation  of 
the  coordinate  system. 

Conservation  of  the  mean  is  met  if  Q 'p'  belongs  to  a  func¬ 
tion  space  Vk(flp)  with  a  basis  of  zero  mean  polynomials 
defined  by: 


4.2  ZM-HOR-Algorithm 


The  aim  is  to  represent  the  discrete  solution  Q  in  each 
cell  P  by  a  polynomial  function  Qp  *(z,  y)  of  degree  k 
where  fc>l.  The  discrete  solution  value  QP  assigned  to 
cell  P  is  not  related  to  any  reference  point  of  the  cell, 
it  is  supposed  to  be  the  mean  of  the  solution  distribu¬ 
tion  over  the  cell  whereas  the  notation  Qp  denotes  the 
discrete  solution  value  at  some  reference  point  fp  of  the 


T  —  (FV^(r)  :0<i  +  j <k  and  r  6  Up] 
where  i  and  j  arc  always  positive  and 


f  1 

© 

1! 

'*-a 

li 

£ 

< 

l! 

*Vp  -  iff 

for  0  <  t  -f-  j 

<k 

(30) 

Azp  = 

x  —  xp 

(31) 

Aj ip  = 

y-vr 

(32) 

Ipp 

ftp  I  In  p 

Aijo.Aj/j,.  dfl 

(33) 

4.2.1  Darth’s  Version 

Barth  [7,  8]  proposes  the  following  form  for 
struction  polynomial  in  the  cell  P: 

the  recon- 

i+J<k 

Q(r\n  =  E  K, 

•  H=o 

=  V.F 

•  Fij(r) 

(34) 

with 

< 

II 

7 

Cj 

c*. 

Vo.k\T 

(35) 

and 

F=[F0l0(?)  •••  F,,y(r)  ... 

F„,r(?)]'r 

(36) 

The  expression  V.F  is  the  scalar  product  of  the  two  vec¬ 
tors;  i.e.  the  sum  of  the  products  of  their  corresponding 
components.  Note  also  that  the  coordinates  of  any  vector 
f  are  given  in  a  normalized  coordinate  system  related  the 
cell  P. 

Barth  suggests  further  the  following  form  for  the  coeffi¬ 
cients  in  the  vector  V : 

V  =  W.Q  (37) 

where 

ipRi 
”0.0 
H/«l 

w  =  w\* 

.  Ki 

and 

Q-[Q,r.Q«a  ■  (39) 

in  which  W%R  denotes  the  weight  assigned  to  th.  support 
cell  R  when  computing  the  coefficient  V,,,  o'  the  recon¬ 
struction  polynomial  in  cell  P.  The  subscript  u  gives  the 
number  of  cells  present  in  the  support  for  cell  P.  The 
support  is  a_set  of  ceils  in  the  neighbourhood  of  cell  P. 
The  array  Q  contains  the  discrete  solution  averages  in 
each  support  cell. 

The  weight  factors  WRj  can  now  be  determined  by  requir¬ 
ing  that  each  of  the  zero  mean  basis  polynomials  F,,t(f), 
with  0<s  +  t<k,  must  be  represented  exactly  by  the  re¬ 
construction  algorithm  independently  of  the  local  grid 
shape.  In  other  words,  the  right  hand  side  of  eq.  (34) 
must  match  exactly  with  each  basis  polynomial  Fj,t(r) 
over  the  cell  P  if  F,,t(r)  is  specified  as  exact  cell  aver¬ 
ages  <  F,, t(r)  >p  and  <  F,,,(r)  >R  in  the  cells  P  and  R 
respectively: 


Kj  ■ 

•  ' 

W  hz 

*M,0 

*M,0 

<£  ■ 

\V  Hn 

(38) 


F(P)  =  A  .  F(r)  (40) 

where  A  is  now  a  matrix  whose  rows  contain  the  coeffi¬ 
cients  A'j  (0<«  +  j<k  and  0<s  +  t<k)  to  reconstruct  the 
corresponding  basis  polynomial  F,,t(r)  in  the  left  hand 


side  of  eq.  (40): 

A  = 

A0.0 

*0,0 

Al>0 

*0,0 

qooo 
©  ~  ~  •  * 

. .  0 

..  yi1,0 

*0,k 

4°.* 

*0,0 

A**  . 

*\,0 

A0,k  . 

It  is  easily  seen  that  cq.  (40)  is  satisfied  if 


A  =  A  (42) 

where  A  is  the  (m  x  m)  unity  matrix  with  m  the  number 
of  coefficients  in  a  two  dimensional  polynomial  of  degree 


m  =  (43) 

Knowing  that  one  is  reconstructing  a  known  function  vec¬ 
tor  F(r)  and  using  cq.  (37),  this  results  in  the  following 
set  of  systems  of  equations: 


W.F=A  (44) 

or 

[Ff.[W]T  =  A  (45) 

whereby  the  matrix  F  contains  the  mean  <  F;,y(r)  >R 
of  the  known  basis  polynomials  over  each  support  cell  R 
with  0<»  +  j<kx 


<  Fo,o  >H‘  <  Fi.o  >Rl  <  F0,k  >"*  ‘ 

_  <  Fo,o  >Ra  <  Fi,0  >R 7  •  •  •  <  F0,k  >Rl 

F  = 

.  <  Fo,o  >R *  <  Fi.o  >R"  ■■■  <F0k  >R * 

(46) 

By  construction  the  elements  of  F  only  depend  on  geo¬ 
metrical  characteristics  of  the  support  used  for  the  recon¬ 
struction  in  cell  P.  Therefore,  also  the  weights  obtained 
as  a  solution  of  cq.  (44)  arc  only  geometry  dependent. 
Eq.  (45)  represents  a  set  of  m  systems  of  m  equations 
with  n  unknowns  where  m  is  given  by  eq.  (43)  and  n  is 
the  number  of  cells  R  taken  into  the  support.  The  first 
system  of  eq.  (45)  reads  while  taking  into  account  that 
<  Fo.o  >rt  5  1,  V«: 


and 


R 


=0  ,0<i  +  j<k 

H 

This  reduces  the  system  (44)  to: 

W.I  =  A 

in  which  the  matrix  I  is  given  by 


jRi,p 

i0,0 

/ria.P 

‘0,0 

IR*  F  .. 

Jl,0 

1*3, p 

M,0 

/« s.f*  • 

'o.* 

1A3.P 

‘(IA 

1! 

J  Rn,P 
*0,0 

7«n.P 

1 1,0 

,H..P 

'o.k 

(47) 


(48) 


(49) 


(50) 


where  I  R;r  k  i, he  inertial  moment  of  order  (i, ))  of  the 
support  cells  R  with  tcspcct  to  the  reference  point  r*p  of 
cell  P: 


'S'  ’  "  (51> 

Remark  that  lRj/>  =  1,  Vif.  To  be  able  to  solve  each  sys¬ 
tem  of  equations  exactly  in  (49),  the  number  of  support 
cells  >r  should  be  taken  at  least  equal  to  m.  The  selection 
of  the  cells  R  needed  for  the  algorithm  will  be  discussed 
in  a  later  section. 
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As  n  is  at  least  equal  to  m,  each  system  of  eq.  (49)  is  an 
undcrdetermined  linear  system  of  equations.  They  can 
be  Bolvcd  using  a  modified  Gram-Schmidt  algorithm  with 
column  pivoting,  see  ref.  [27],  The  arbitrary  constants  in 
the  general  solution  of  eq.  (49)  can  be  determined  by  im¬ 
posing  additional  constraints  on  the  weights  W*r  As  we 
want  now  the  reconstruction  to  be  as  insensitive  as  pos¬ 
sible  to  noise  in  the  given  data,  the  norm  of  the  weights 
mast  be  minimal: 

5Z  (w&)  must  be  minimal  (52) 

R 

for  0<$  -f  j< k.  In  ref.  [27],  it  is  shown  that  the  least 
squares  problem  (52)  is  equivalent  to  solving  an  overde- 
termined  system  of  n  equations  with  n  -  m  unknowns 
(the  arbitrary  constants  in  the  general  solution).  A  least 
squares  solution  of  the  system  can  be  obtained  using 
again  the  modified  Gram-Schmidt  algorithm  mentioned 
above  and  is  then  inserted  in  the  general  solution  oi 
cq.  (64). 

4.2.2  Present  Version 

The  version  of  the  ZM-HOR-Algorithm  presented  here 
differs  from  Barth’s  version  in  two  manners.  First,  no 
local  coordinate  transformation  is  performed  towards  a 
normalized  system.  Secondly,  in  this  approach,  the  re¬ 
construction  is  seen  as  separated  into  a  constant  part 
equal  to  the  given  cell  average  and  a  part  representing 
the  higher  order  terms: 

Q{P(r)=QP  +  [V.F  -  V'o.o.Fo.o^)]  (53) 

where  V  is  again  the  vector  as  in  cq.  (34)  and  (35).  In 
other  words,  the  term  Vo,o.Fo,o(r)  *n  co-  (34)  is  replaced 
by  the  given  cell  average.  Equation  (53)  can  therefore  be 
rewritten  as: 


qP(?)  =  qp  +  v.  f 


(54) 


with 


V  = 

'  ViJ  •’ 

•  W 

(55) 

and 

F  =[Fi,o 

(?)  ...  F,j(?) 

Fq,*(?)]7 

(56) 

Note  that  the  elements  with  subscript  (0,0)  arc  no  longer 
present  in  eq.  (55)  and  (56). 

As  in  Barth *8  version,  the  coefficients  in  V  in  eq.  (54)  are 
viewed  as  a  weighted  sum  of  the  given  discrete  averages 
in  the  support  cells: 

V  =  W.$ 

(57) 

where  now 

W  = 

'  <o‘  <*<?  • 

^2,0  "2,0 

WR* 

"1,0 

"2,0 

(58) 

.  wRi  ^*2 

•  VVR* 
"0  ,* 

Again,  the  weight  factors  W’,“  with  0  <  i  +  j  <  k  can 
now  be  determined  by  requinng  that  each  of  the  zero 
mean  basis  polynomials  F,,t(f),  with  0  <  a  +  f  <  k,  must 
be  represented  exactly  by  the  reconstruction  algorithm 
independently  of  the  local  grid  shape. 


F(P)  =  <  F(F)  >p  +  A.F(F) 


(59) 


where,  similar  to  Barth's  version,  A  is  now  a  [m  x  (m  — 
1)]  matrix  defined  by: 


II 

*•< 

*0,0 

^m.o 

A  1,0 
^1,0 

>1°*°  . 
/i2,0 

A 1,0  . 

.  4°'° 

/10,k 

A1'0 
'  A0,k 

A0,h 
71 1,0 

A^k  . 

**2,0 

.  A*'h 
A0,k 

Eq.  (59)  is  satisfied  over  the  whole  cell  P  if 


(60) 


(61) 


where  A  is  now  the  [(m  —  1)  x  (m  —  1)]  unity  matrix. 
Knowing  that  one  is  reconstructing  a  known  function  vec¬ 
tor  F(r)  and  using  eq.  (57),  this  results  in  the  following 
set  of  systems  of  equations: 


W.Fs=  B  (62) 

whereby  the  matrix  F  remains  the  same  as  in  eq.  (46). 
Note  that  the  components  of  r  are  still  related  to  some 
reference  point  rp  associated  with  the  cell  P. 

Since  <  Fo,o(r)  >R  H  1,  the  first  system  of  eq.  (62)  leads 
now  to: 


=0  .  0<»  +  i<k 

(63) 

R 

This  reduces  the  system  (62)  to: 

W.I  —  B 

(64) 

or 

iTwv  =  nT 

(65) 

where  the  matrix  /  is  defined  in  cq.  (50).  Equation  (65) 
represents  a  set  of  m  -  1  systems  of  m  equations  with  n 
unknowns  where  n  is  the  number  of  support  cells  and  m 
is  given  by  eq.  (43).  Solving  this  set  of  systems  is  done 
in  exactly  the  same  way  as  in  Barth’s  version  except  that 
the  number  of  systems  to  solve  is  reduced  by  one. 

4.2.3  Conservation  of  the  Menu 

We  will  now  investigate  to  what  extent  both  versions  (34) 
and  (54)  of  the  ZM-HOR-Algorithm  conserve  the  mean 
of  a  solution  function  Q(r)  specified  as  a  discrete  s;t  of 
cell  averages  QP  whereby: 


QP  =  <  Q  >p  (66) 

As  zero  mean  polynomials  are  used,  it  follows  immedi¬ 
ately  that  the  mean  of  eq.  (54)  is  equal  to  the  given  cell 
average.  This  means  that  the  present  version  always  con¬ 
serves  the  mean  even  if  the  degree  of  the  function  to  re¬ 
construct  is  beyond  k. 

Taking  the  mean  of  Bar  th’s  formulation  of  the  reconstruc¬ 
tion  polynomial  in  eq.  (34),  one  finds: 

<  Q(p\n  >P  =  Vb,o  =  £  W0RoQr  (67) 
R 

Let  us  now  expand  Q  in  a  Taylor  scries  and  average  it 
over  each  support  cell  R,  Therefore,  two  vectors  are  now 
introduced: 


wfi.P 

*/ 


jB,P 

t-m,n 


(68) 


containing  the  inertial  moments  of  cell  R  of  exactly  or¬ 
der  l  with  respect  to  the  reference  point  rp  of  cell  P  and 


OQf  =  (69) 

(|)  -t>Qo!i 

with 

««C-m.-5;i3S?:|i,  •  •£“£'  <™> 

the  derivatives  of  order  /  of  the  function  Q  at  the  reference 
point  rp  of  cc\\_P.  Onccan  then  average  the  Taylor  scricB 
expansion  of  Qp  and  QR  around  the  reference  point  rp 
giving  (note  that  the  operators  <  ■  >F  and  <  ■  >R  are 
linear): 

Qr  =  <Q>P 

|t=I  *  m=0 

=  Q'’  +  EjtIf‘P  (71) 

1=1  ** 

and 

s  <<?>h 

=  Op+Ej  £  wl . . 

(el  '  m=0 

=  Qr  +  ^  j7 iP'^.dQt  (72) 

i=i  *' 

where  Qp  =  Q(fr)  and  Qn  =  Q(fp).  Inserting  the  above 
equation  in  eq.  (67)  while  using  also  eq.  (48)  leads  to: 

< Op’tO >F  -  Qp  +  tr, ’£«’’> (if a**f ) 

I  ix  1  •*  *2 

+  0(hk")V%' 

=  e.+Ei(E"'A>r)  »of 

+  0(Ak+,)»g+l  (73) 

The  expression  Z>^+l  stands  for  space  derivatives  of  or¬ 
der  k  +  1  of  Q  and  h  is  a  typical  length  associated  with 
cell  P;  e.g.  h  —  y/(ip.  Knowing  that  I  is  never  zero  in  the 
summation  of  eq.  (73)  and  applying  eqs.  (48),  one  finds 
that  the  summation  over  I  is  equal  to  zero  and  one  gets 
using  eq.  (71): 

<qP(t)>p  =  QP  +  0(hfc+,)p£M 
=  QP  +  0(h)Vlq 


Hence,  Barth's  version  of  the  ZM- II  OR- Algorithm  only 
conserves  the  mean  to  order  h,  even  if  the  given  func¬ 
tion  Q  were  a  polynomial  of  degree  r<k.  In  that  case, 
the  mean  of  the  construction  would  exactly  equal  the 
value  Qp  of  the  function  Q  in  the  reference  point  of  cell  /’ 
As  the  reference  point  is  in  most  cases  equal  to  the  cen¬ 
troid  of  the  cell  and  as  Q  is  not  always  linear,  Qp  is  in 
general  not  equal  to  Qp.  However,  conservation  of  the 
mean  will  be  established  in  the  limit  of  h-»0. 

4.3  The  GC-HOR* Algorithm 

The  discrete  solution  Qp  of  cell  P  is  now  supposed  to 
be  given  at  some  reference  point  fp  associated  with  the 
cell  P: 

Qp  =  Q(f'p)  (74) 

In  practice,  the  reference  point  will  coincide  with  the 
gravity  center  of  the  cell  P. 

The  concept  of  a  generic  reference  point  for  the  recon¬ 
struction  algorithm  1b  also  very  useful  for  studying  cell- 
vertex  schemes  for  which  the  discretized  solution  is  in 
general  not  stored  at  the  gravity  center  of  the  associated 
control  volume.  However,  in  the  frame  of  this  contribu¬ 
tion,  the  ideas  are  only  fixed  on  cell-centered  storage. 

The  aim  is  again  to  represent  the  solution  in  each  cell  P 
by  a  polynomial  function  Qp*(f)  of  degree  k  where  k>l. 
The  cell  P  can  either  be  an  interior  cell  or  a  boundary 
cell.  The  function  Q^  has  to  satisfy  the  following  re¬ 
quirements,  fig.  19: 

•  its  value  at  the  reference  point  rp(xp,yp)  of  cell  P 
must  be  equal  to  the  discrete  solution  value  assigned 
to  the  cell  P: 

Q{p\tp)  =  Qr  (75) 

•  it  must  represent  polynomial  functions  of  degree 
r<k  exactly  over  the  whole  cell  area.  In  this  case, 
the  reconstruction  Q<lr)  over  the  complete  domain 
will  be  continuous  across  the  cell  edges.  If  r  >  Jfc 
then  discontinuities  across  the  cell  edges  are  allowed. 

Again,  the  origin  of  the  coordinate  system  is  always  trans¬ 
lated  towards  the  reference  point  fp  of  the  cell  P  in  order 
to  reduce  the  influence  of  round-off  error  in  the  algorithm. 
Requirement  (75)  is  met  if  Q p*  belongs  to  a  function 
space  14  (Qp)  where  its  basis  is  given  by 

<7  —  {Gi,> (f)  :  0  <  i  +  j  <  k  and  f  €  Qp), 
where  now  the  basis  functions  Gi,,(f)  are  given  by 

Gi,}  =  Ar'p.  AyJ>  :  0  <  i  +  j  <  k  (76) 

As  for  the  ZM-IiOR-Algorithni,  the  reconstruction  poly¬ 
nomial  is  given  by  a  linear  combination  of  the  basis  poly¬ 
nomials: 

QP(f)  =  V.G  (77) 

where  the  vector  V  is  given  by  eq.  (55)  and  the  vector  G 
is  similar  to  the  vector  F  defined  in  eq.  (56)  but  uses  the 
non-xero  mean  basis  polynomials  G>,> : 

G=  [Gi.oOO  Gij(r)  Gm(?)]T  (78) 

Also  here,  the  coefficients  in  the  vector  V  in  eq.  (77)  arc 
written  as  a  weighted  sum  of  the  given  discrete  values 
Qr  in  a  set  of  cells  in  the  neighbourhood  of  the  cell  P: 

V  =  W.Q  (79) 
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in  which 


The  solution  W  of  the  system  is  easily  obtained: 


^  (80) 

A  similar  reasoning  as  for  the  ZM-1IOR  algorithm  is  now 
followed  leading  to  the  next  set  of  systems  of  equation 
for  the  reconstruction  weights: 


W.D  =  A 

in  which  the  matrix  D  is  defined  as: 


D  = 

' 

■ 

G.riS  • 

••  ' 

°0  ,k 

.  <o 

eft  • 

•  •  cRn 

u0,fc 

(81) 


(82) 


with  =  <Ji,>(fn)  the  point  value  of  a  basis  polyno¬ 
mials  at  the  reference  point  of  a  support  cell  Ji.  Remark 
again  that  eq.  (81)  gives  the  following  equation  for  recon¬ 
structing  constant  basis  polynomial: 


52  W*  =0  for  0  <  t  +j  <  k  (83) 

R 

The  set  of  systems  (81)  is  solved  just  as  it  was  solved  in 
the  case  of  the  zero  mean  polynomials  while  adding  the 
constraint  of  minimal  norm. 

4.4  Example 

Suppose  one  wants  to  perform  a  linear  reconstruction  us¬ 
ing  the  GC-HOR  algorithm  >n  th-  cell  P  o'.‘  a  regular  or¬ 
thogonal  rectangular  mesh  with  mesh  size  A*  =  A'j  =  1, 
sec  f'g-  20.  The  reference  point  of  each  cell  is  taken 
at  the  gravity  center.  The  support  for  the  reconstruc¬ 
tion  is  chosen  to  contain  the  cell  P  itself  and  its  4  direct 
neighbours. 


Pig.  20:  Support  for  Linear  Reconstruction  on  a  Reg¬ 
ular  Mesh. 


The  matrix  D  in  eq.  (81)  becomes  then: 


1 

0 

0  ' 

1 

-l 

0 

1 

0 

1 

1 

i 

0 

1 

0 

-1 

(84) 


The  first  column  of  D  corresponds  to  eq.  (83),  the  second 
column  gives  the  coefficients  of  the  weights  for  the  term 
in  y  and  the  third  column  gives  the  coefficients  of  the 
weights  for  the  term  in  x.  The  first  row  is  related  to  cell  P 
while  row  j  is  related  the  neighbour  j  —  1  for  j  =  2, 3, 4, 5. 
The  right  hand  side  of  the  system  is  given  by: 


RHS  = 


0  10 
0  0  1 


(85) 


W  = 


0 

1/2 


1/2  0  ' 

0  -1/2 


(86) 


in  which  the  first  row  gives  the  weights  contributing  tv 
the  term  in  y  and  the  second  row  gives  the  weights  con¬ 
tributing  to  the  tetm  in  z. 

The  reconstruction  of  the  solution  Q  in  cell  P  can  then 
be  written  as; 


Q{p\x,y)  =  QP  +  9±^9>..  (x  .  xr) 

T--j— («7) 

From  which  it  is  found  that  the  gradient  of  Q ^  is  given 
by  the  classic  central  approximation  formulas. 

As  the  value  of  a  linear  function  in  the  gravity  center  of  a 
cell  is  also  equal  to  the  mean  value  of  the  linear  function 
over  the  cell,  it  can  be  shown  easily  that  the  ZM-HOR- 
algorithm  gives  the  same  reconstruction  weights  when 
performing  a  first  order  reconstruction. 

4.5  Caveat 

If  one  takes  a  support  as  shown  in  fig.  21  in  order  to  per¬ 
form  a  second  order  reconstruction,  then  one  finds  that 
all  coefficients  in  the  matrix  A  of  the  weights  contribut¬ 
ing  to  the  term  in  xy  arc  zero.  Note  that  always  either 
Ax  or  Ay  is  zero  for  each  of  the  elements  of  the  support 
depicted  in  fig.  21. 


Fig.  21:  Erratic  Support  for  Quadratic  Reconstruc¬ 
tion. 

The  product  of  both  will  therefore  always  be  zero,  see 
also  eq.  (81)  with  s  —  i  =  1.  As  in  the  RJIS  one  finds 
a  unit  block  matrix,  the  equation  with  i  =  j  =  1  will 
be  contradictory  since  it  states  that  0  should  equal  1. 
Therefore,  second  order  polynomial  reconstruction  on  a 
regular  rectangular  mesh  always  requires  taking  elements 
along  the  diagonal  of  cell  P  into  the  support. 

Note  that  this  problem  also  occurs  when  using  the  ZM- 
HOR-algorithm.  The  mean  of  Fi,<  (x,y)  will  be  zero  over 
each  of  the  support  cells  since  F\,i  is  an  uneven  function 
being  integrated  over  a  domain  with  symmetric  bound¬ 
aries. 

In  general,  one  can  say  that  a  contradictory  system  is 
expected  if  there  are  not  enough  cells  in  all  space  di¬ 
rections.  A  contradictory  system  is  avoided  if,  see  also 
eqs.  (49), (64), (81): 

rank  IT  =  m  or  rank  DT  =  m  (88) 

where  m  is  given  by  eq.  (43).  The  condition  can  be  satis¬ 
fied  by  choosing  an  oversized  stencil  which  obviously  in¬ 
creases  the  computational  load.  In  this  work,  the  stencil 
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is  in  general  oversized  by  2.  In  ref.  [16],  Marten  proposes 
other  techniques  for  selecting  the  support  in  order  to 
obtain  a  sufficient  number  of  linearly  independent  equa¬ 
tions.  However,  these  techniques  arc  all  strongly  finked 
with  the  chosen  reconstruction  algorithm. 

4.6  Accuracy 

Theorem  4.1 

If  a  given  solution  function  Q  varies  smoothly  ami  ifQ  is 
specified  as  a  discrete  set  of  cell  averages  in  each  cell,  then 
the  ZM-II OR- Algorithm  given  by  eg.  (5f)  approximates  a 
given  solution  function  Q  to  order  hk+>  with  k  the  degree 
of  the  reconstruction  polynomials. 

Proof  of  Theorem  4.1: 

As  Q  varies  smoothly,  one  can  then  average  the  Taylor 
scries  expansion  as  given  in  cq.  (72)  using  the  definitions 
in  eq.  (68)  and  (69).  Inserting  the  expression  of  cq.  (72) 
for  Qh  in  cq.  (54)  yields: 

Q^\r)  =  Qr 

•+r<«<  *  ,  /  \ 

+  E 

i+j>o  1=1  '  \  n  / 

+  0(h*+*).’P£M 

Let-  us  now  define  the  vector  F|  as: 

Fj=[F|i0  •••  •••  F,),j]  (89) 

i.c.  the  vector  containing  all  basis  functions  of  exactly 
degree  I. 

Using  cq.  (64)  and  expanding  also  Qp  while  truncating 
the  series  at  1  =  k,  one  gets: 


i=i  *■ 

where  ‘Pq+l  stands  for  space  derivatives  of  order  Is  +  1  of 
Q  and  A  typical  length  associated  with  cell  P\  e.g. 
h  =  y/Hp.  The  m,h  element  of  the  vector  Cf  is  given 
by: 

C,1  m,m  =  (i-*p)'-n,.(y-yrr  (90) 

Remark  then  also  that  due  to  eq.  (30): 

V,  =  C,  -  l\'  P  (91) 

The  reconstruction  polynomial  reads  then: 

Q\’\n  =  Qr  +  J2  icf.3Qf  +  0(fc*f2)  V%' 

i=i  *' 

As  the  Taylor  series  expansion  of  Q(f)  around  the  refer¬ 
ence  point  rp  is  given  by: 


k  i 

*■  Q»  +  E  Ji  E  Ct-m 

le I  ’  maO 

+  0(Ak+,).Py+'  ■ 

=  Q,>  +  J2T,cl,'.dQ,,’  +  0{hk+,).vkj 1 

one  finuj  that: 

<?<;>(f)  =  Q(f)  -i  0(A*+1).2>£+1  (92) 

C) 

A  similar  reasoning  for  Barth’s  version  of  the  ZM- If  011- 
Algorithm  leads  tc  the  next  theorem: 

Theorem  4.2 

//  a  given  solution  function  Q  varies  smoothly  and  if  Q  is 
specified  as  a  discrete  set  of  cell  averages  in  each  cell,  then 
the  ZM-II Oil- Algorithm  given  by  cq.  ($4)  approximates  a 
given  solution  function  Q  to  order  Alt+ 1  with  k  the  degree 
of  the  reconstruction  polynomials. 

We  will  now  proof  the  following  theorem  concerning  the 
accuracy  of  the  GC-IlOR-Algorithm: 

Theorem  4.3 

If  n  given  solution  function  Q  varies  smoothly  and  if  Q  is 
specified  as  a  discrete  set  of  cell  averages  in  each  cell,  then 
the  OC-HOli- Algorithm  given  by  cq.  (77)  approximates  a 
given  solution  function  Q  to  order  hk  { '  with  k  the  degree 
of  the  reconstruction  polynomials. 

Proof  of  Theorem  4-3: 

In  a  similar  way  as  for  the  ZM-HOK-Algoritlim,  one  per¬ 
forms  a  Taylor  series  expansion  around  the  reference  rp  of 
Qn  in  the  expression  for  Vit)  in  eq.  (77).  Using  the  prop¬ 
erties  of  the  weights  IF,"  expressed  by  cq.  (81)  and  (83), 
one  finds  again: 

Q(p\f)  =  Q(f)  +  «(A*+,).7>*+’  (93) 

□ 

4.7  Results 

In  the  following,  some  reconstructions  of  known  polyno¬ 
mials  using  the  GO-HOR-algorithm  arc  shown.  Similar 
results  were  obtained  with  the  ZM-lfOR-algorithm.  A 
numerical  error  study  on  the  higher  order  reconstruction 
algorithm  was  carried  out  whereby  the  error  between  the 
reconstruction  and  the  exactly  known  polynomial  is  ex¬ 
amined.  The  next  polynomials  were  rather  arbitrarily 
selected  to  perform  the  numerical  error  study: 


Pi(*>y)  - 

2*  +  3y  +  2 

I’tlx.y)  - 

+  zy 

-  y  +  f- 

Fb(*,y)  = 

Ml  t  ny  + 

-  -  \xy  +  10 

F«(*,y)  = 

x*  +  16y4  +  8*y3 

—  34*  -f  10 

Figures  22,  23,  24  and  25  show  an  isovalue  plot  of  the 
four  polynomials  on  a  given  rectangular  domain. 

Three  different  meshes  were  used  on  the  domain  to  re¬ 
construct  the  previous  polynomials  using  the  Non-Zero 
Mean  HOR-algorithm  while  taking  the  gravity  center  as 
a  reference  point:  a)  a  regular  mesh  (fig.  26a),  b)  as  a) 
but  twice  as  fine  (fig.  26b)  and  c)  an  irregular  mesh  with 


Fig.  22:  Isolines  of  Test  Polynomial  PI  for  the  GC-HOR- Algorithm. 


Fig.  23:  Isolincs  of  Test  Polynomial  P2  for  the  GC-HOR-Algorithin. 


the  same  number  of  points  as  in  a)  but  they  arc  moved 
randomly  (fig.  26c). 

The  values  of  the  reconstructed  polynomials  were  then 
compared  with  the  exact  values  of  the  polynomials.  The 
graphs  27a  and  27b  show  the  //j-norm  of  the  error  made 
all  over  the  domain  as  a  function  of  the  order  of  the  recon¬ 
struction  and  this  for  each  of  the  four  given  polynomials. 
The  /.  j  norm  is  given  by: 


£*..rr  =  \jjjn  «#*  ~  <?)*■«*«  <95) 

in  which  Q p*  is  the  reconstructed  function,  Q  the  given 
polynomial  function.  Fq.  (95)  is  discretized  by  cover¬ 
ing  the  domain  with  a  fine  point  cloud.  This  cloud  is 
triangulated  and  the  error  is  assumed  to  vary  linearly  in 
each  triangle.  The  integral  over  the  whole  domain  is  then 
the  sum  of  the  integrals  over  each  triangle.  As  about  13 
points  were  taken  in  each  grid  cell,  the  discretization  er¬ 
ror  will  be  relatively  small  (typically  0,1  %). 

Comparing  the  /,;-norms  in  fig.  27a  for  the  coarse  and  the 
fine  mesh,  one  can  conclude  that  the  reconstruction  error 
is,  as  expected,  roughly  of  order  G(fc*+'),  if  r  >  k.  This 
dependency  on  the  grid  spacing  is  valid  for  all  polynomial 
degrees  r  larger  than  k.  Note  also  that  the  error  is  hardly 
affected  by  the  grid  irregularity,  as  can  be  deduced  form 
figs.  27a  and  27b.  From  these  two  figures,  it  also  obvious 
that  the  reconstruction  error  is  indeed  increasing  for  a 
given  k  when  r  is  increasing.  . 

In  order  to  see  how  the  errors  ate  distributed  over  the 
domain,  the  relative  to-norm  of  the  difference  between 
the  two  functions  over  each  individual  cell  is  computed 
as  follows: 


This  equation  is  discretized  in  exactly  the  same  as  cq.  (95) 
but  now  only  over  each  cell  individually. 

Making  a  plot  of  the  relative  to-r.orni  of  the  error  would 
only  produce  random  noise  if  r  <  Jfc.  Therefore,  it  makes 
only  sense  to  examine  the  error  distribution  over  the  do¬ 
main  when  r  >  k.  Figures  28a,  28b  and  28c  show  the 
relative  error  distributions  for  the  polynomials  P2,  P3 
am!  J'-i  when  using  a  fust  order  reconstruction  on  the 
irregular  mesh.  It  is  observer!  that  the  largest  errors  oc¬ 
cur  where  the  isolincs  of  the  polynomial  exhibit  a  high 
curvature  or  where  the  distance  between  the  isolincs  is 
varying  strongly.  The  effect  of  the  terms  of  higher  degree 
in  the  polynomials  Pi,  P»  and  P«  is  stronger  in  these  loca¬ 
tions;  therefore,  the  linear  reconstruction  cannot  match 
anymore  with  the  imposed  polynomial  and  the  error  in¬ 
creases.  The  same  behaviour  is  seen  on  the  two  other 
meshes  and  for  the  higher  order  reconstructions  as  long 
as  r  >  Jt. 

Finally,  fig.  29a,  29b  and  29c  present  the  linear,  quadratic 
and  respectively  the  cubic  reconstruction  of  the  function 
P. j  on  an  irregular  mesh.  These  figures  demonstrate  the 
increasing  quality  of  the  reconstruction  with  increasing 
reconstruction  order.  Again,  an  exact  match  is  found 
when  the  order  of  the  reconstruction  equals  the  degree  of 
the  polynomial. 

4.8  Remark 

Solution  reconstruction  is  just  a  way  of  interpolating  a 
given  set  of  discrete  values.  Hence,  it  is  clear  that,  besides 
polynomial  basis  functions,  other  basis  functions  can  be 


Fig.  24:  Isolincs  of  Tost  Polynomial  P3  for  the  GC-llOR-Algorithm. 


Fig  25:  Isolines  of  Tost  Polynomial  Pt  for  the  GC-HOR- Algorithm. 


used  for  solution  reconstniction  such  as  trigonometric, 
hyperbolic  or  Bessel  functions.  Unlike  in  spectral  meth¬ 
ods,  the  interpolants  need  only  to  be  cell  wise  continuous; 
i.e.  discontinuities  ate  allowed  across  cell  interfaces. 
Another  interesting  interpolation  technique  was  proposed 
by  Do.  Meyer  ci  al.  ['28,  29],  whereby  a  mixture  of  trigono¬ 
metric  and  polynomial  functions  is  used.  Research  is  still 
to  be  done  to  generalize  this  technique  to  finite  volume 
schemes  in  multiple  dimensions. 


Fig.  27a:  Graph  of  /-a-norm  of  the  Reconstruc-  Fig.  27b:  Graph  of  Li-  norm  of  the  Recon- 

tion  Error,  Regular  and  llclincd  Mesh  struction  Error,  Irregular  Mesh  (G0- 

(GC-HOR).  IIOR). 


Fig.  28a:  Error  Distribution  for  1**  Order  Reconstruction  of  Pi  (GC-HOR), 
cellwiee  /,2-norms. 


Fig.  28b:  Error  Distribution  for  1*'  Order  Reconstruction  of  P3  (GC-HGR), 
cellwise  f*3-norms. 
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5  HOR-SCHEMES 


Eq.  (98)  will  be  solved  exactly  if  the  next  equation  is 
satisfied  VP  €  ft: 


This  section  discusses  2D  upwind  finite  volume  schemes 
applying  higher  order  polynomial  solution  or  flux  recon¬ 
struction  for  linear  and  nonlinear  convection  problems 
with  a  source  term.  As  yet,  no  attention  will  be  paid 
to  the  rnonotonicity  of  the  schemes  nor  will  stability  be 
taken  into  consideration. 


ffn  Tt  dU  +  fm  r'-Sq  ds  =  Jfn  0°°) 

Using  the  results  of  approximate  Ricmann  solvers,  an  up¬ 
wind  space  discretization  of  the  left  hand  side  of  eq.  (100) 
with  a  first  order  accurate  time  discretization  is  obtained 
and  the  scheme  becomes  (after  division  by  ftp): 


5.1  Linear  Problem 


+  [*>(*?«)] (Qn)  =  WS)](Qn)  (101) 

with 


Consider  the  2D  linear  convection  equation  with  source 
term  to  be  solved  in  a  domain  ft: 


9)]  (Q)  = 

Xv  zC  (‘pr^po  +  aPR-Q(Ro)  Asc(102) 

*  R  O 


=  S(q,r ) 


(97) 


the  space  discretization,  D(S)  an  as  yet  undefined  dis¬ 
cretization  of  the  source  term  S  and 


°pr  =  max  (3.8,0) 

aPR  ~  m'n  («'8, 0) 
from  which  it  follows  that 


(103) 


where  a  is  a  constant  convection  vector  and  q  the  solu¬ 
tion  of  the  analytical  problem.  Integration  over  ft  gives: 


aPR  +  aPR  ~  “  H  • 

Remember  that  o  is  a  constant  vector.  is  the  mean  of 
the  discrete  solution  distribution  over  the  cell  P  at  time 
level  n. 


SL 


.dft  + 


(98) 


Discretizing  the  domain  by  partitioning  it  into  smaller 
cells  P  of  arbitrary  polygonal  shape  and  applying  Green’s 
theorem  to  the  second  term  in  the  left  hand  side,  eq.  (98) 
can  be  written  as: 


dq 

dt 


dft  + 


n.a.q.ds 


Fig-  30:  Gauss  points  O  on  the  edge  between  the  cells 
P  and  R. 

The  value  of  Q^po  and  **  the  value  of  k,h  order 
polynomial  reconstruction  at  the  Gauss  points  0  on  the 
cell  interface  between  cell  P  and  its  neighbour  R,  seen 
from  cell  P  resp.  cell  R  (see  also  eq.  (54)  and  fig.  30): 


-  S(q,  f).<fft  (99) 


whereby  ftp  =  ft  and  n  is  the  outward  unit  normal 
vector  at  the  edges  of  the  cell  P  while  the  contour  integra¬ 
tion  has  to  be  carried  out  in  counter-clockwise  direction. 


Qpo 

—  Qp  + 

K), 

F P(?o) 

(104) 

eft 

=  <Tr  + 

F R(?o) 

(105) 

or  following  eq. 

(77): 

Wro 

=  Qp  + 

(W.Q 

■GP(?o) 

(106) 

of*) 

**/JO 

=  Qr  + 

(w.q")k 

G  Vo) 

(107) 
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in  which  Fp  and  Fn  ue  the  vectors  containing  the  zero 
mean  basis  polynomials  related  to  the  reference  points  of 
cell  P  resp.  R: 

f£(0  =  (*~*pY  (v-vrY  ~itf 

F&n  =  (x-TH)'.(y-VRy  -  I%R 

.  0  <»  +  ><* 

and  where  G^  and  GR  are  the  vectors  containing  the 
basis  polynomials  for  the  GC-HOR-Algorithm  related  to 
the  reference  points  of  cell  P  resp.  R: 

0Pj (f)  =  (r  -  xp)‘.(y  -  yp)’ 

<3*(0  =  (i  -*«)*•(»  -vr)3 
,  0  <»-§-><  k 

Sec  also  eqs.  (30),  (33)  and  (76).  Note  also  that  the  cell  P 
and  its  direct  neighbours  R  have  in  general  a  different 
support. 

Theorem  5.1 

The  space  discretization  of  the  scheme  given  by  eq.  (lOt ) 
and  cq.  (102)  is  order  hk  accurate  on  any  type  of  mesh 
wherever  the  solution  is  smooth: 

[D  (fl.fy)]  (<l)  =  <  S.^q  >P  +0(A*)Dj+1  (108) 

Note  that  in  eq.  (108)  the  symbol  q  denotes  the  solution 
of  the  exact  solution  of  the  (analytical)  equivalent  dif¬ 
ferential  equation  [30,  12],  whereas  in  the  following,  the 
symbol  Q  stands  for  the  discrete  solution  function  at  a 
given  moment  during  the  computation. 

Two  proofs  for  this  theorem  are  given  in  appendix  A.  A 
first  proof  is  based  on  eqs.  (92)  and  (93)  stating  that  the 
polynomial  reconstruction  approximates  a  solution  q(f) 
to  order  The  second  proof  uses  a  truncation 

error  analysis  based  on  the  concept  of  the  equivalent  dif¬ 
ferential  equation,  see  also  [30,  12].  The  choice  of  0(5) 
will  follow  from  each  of  the  proofs,  see  subsection  5.4. 

For  the  time  being,  no  attention  will  be  paid  to  the  time 
discretization  which  nevertheless  still  has  an  effect  on  the 
space  discretization  as  well. 

5.1.1  Polynomial  Preservation 

Let  us  now  investigate  what  happens  to  an  exact  poly¬ 
nomial  solution  q  of  degree  r  satisfying  the  steady  state 
of  cq.  (97)  in  ft  when  passing  it  into  the  scheme  given 
by  eq.  (101).  If  the  degree  of  the  polynomial  solution 
does  not  exceed  the  order  of  the  reconstruction  k,  then 
Q^Pq  =  Q^po  =  qo  and  eq.  (101)  reduces  to: 

^  E  E  -to-A»o  =  d(s) 

’  P  R  O 

If  the  number  ot  Gauss  points  on  each  edge  is  larger  than 
or  equal  to  one  finds,  after  applying  the  Gauss  the¬ 
orem: 

As  q  satisfies  S.tfq  =  S(q,f)  everywhere  in  ftp,  this  can 
be  further  reduced  to: 


or 

=  0(5)-  <  S(q,  F)  >r  (109) 

The  right  hand  side  of  eq.  (109)  will  he  zero  if  0(S)  is 
an  exact  discretization  of  cell  average  of  the  source  term. 
This  then  means  that  the  time  change  of  the  polynomial 
steady  state  solution  is  zero.  In  other  words,  an  exact 
polynomial  steady  state  solution  of  degree  r  is  no  longer 
modified  by  the  scheme  (101)  if  the  the  order  of  the  so¬ 
lution  reconstruction  is  at  least  equal  to  r. 

Note  that,  in  the  general  case,  the  difference  between 
0(5)  and  <  S(q,  f)  >p  is  given  by  eq.  (121)  showing 
that  the  difference  is  not  zero.  It  will  be  zero  if  the  source 
term  can  be  averaged  exactly.  This  leads  to  the  follow¬ 
ing  theorem  linking  polynomial  preservation  with  higher 
order  space  accuracy: 

Theorem  5.2 

7  a  scheme  is  polynomial  preserving  up  to  degree  k  and 
if  its  source  term  discretization  is  of  order  hk  in  space, 
then  the  scheme  is  of  order  hk  in  space. 

Proof  of  Theorem  5.2: 

Suppose  that  the  scheme  is  lower  order  accurate: 

D(a.$q)  =  <  aJq  >P  +  0(h”) 2>£+1 

with  p  <  k.  Note  also  that,  in  the  equivalent  differential 
equation  of  a  scheme,  a  derivative  of  order  p+ 1  can  never 
have  a  coefficient  which  is  larger  than  something  of  order 
hp.  Hence,  terms  like  0(hp)T> J+1  with  p  <  k  can  never 
appear. 

Plugging  an  exact  polynomial  solution  q  of  degree  k  in 
the  scheme  of  eq.  (101)  would  lead  to: 

+  b  //„>*«) M + 

=  O(S) 

As  it  is  supposed  here  that  the  source  term  is  discretized 
with  a  space  accuracy  of  0(hk)  with  k  <  p,  this  becomes: 

Qp  -A~  Qp~  =  C>(hp)Vp+'  0 

This  is  in  contradiction  with  the  starting  hypothesis  that 
the  scheme  is  polynomial  preserving.  So,  polynomial 
preservation  up  to  degree  k  implies  k‘h  order  space  accu¬ 
racy  if  the  source  term  is  discretized  exactly. 

D 

5.2  Nonlinear  Problem 

Remind  first  that  q  stands  for  the  exact  solution  of  a 
given  analytical  problem  whereas  Q  denotes  the  discrete 
solution  at  a  certain  moment  during  a  computation. 

Let  the  following  2D  nonlinear  convection  equation  with 
source  term  5  to  be  solved  in  a  domain  ft: 

^+T).P(q,r)  =  S(q,f)  (110) 

where  P  is  some  flux  vector.  Integration  of  eq.  (110) 
over  ft  gives: 
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II  Pa + 1 L  * m - / L s(* f)- "  i} 

Proceeding  as  for  the  linear  problem,  eq.  (Ill)  can  be 
written  as: 


?/l 


^.dft  +  V/  F(q,r).n.ds 

o  JO ftp 


-  ?//».. 


JOUp 


(112) 


whereby  ftp  =  ft  and  n  is  the  outward  unit  normal 
vector  at  the  edges  of  the  cell  P  while  the  contour  integra¬ 
tion  has  to  be  carried  out  in  counter-clockwise  direction. 
Eq.  (Ill)  will  be  solved  exactly  if  the  next  equation  is 
satisfied  'iP  6  ft: 


+ 


JIj 


S(4,f).dft(m) 


The  flux  function  Il(q,f,n)  is  now  defined  as  I  (q,r). n 
and  can  be  evaluated  based  on  the  concepts  of  ID  pro¬ 
jected  Riemann  solvers  leading  to  an  upwind  space  dis¬ 
cretization  of  the  left  hand  side  of  cq.  (113)  with  a  first 
order  accurate  tine  discretization  as  given  in  the  follow¬ 
ing  equation  (after  division  by  ftp): 


Qf+'aT~  +  W")  =  W)]W“)  ("4) 

with 

\d&.P)\  (Q)  =  E  E  *2*  Aso  <115> 

the  space  discretization,  D(S)  an  as  yet  undefined  dis¬ 
cretization  of  the  source  term  S  and  II  rn  a  numerical 
flux  vector  obtained  using  a  ID  approximate  Riemann 
solver.  The  latter  can  be  cither  a  flux  vector  splitter 

(FVS): 

fl?R  =  H*(Q%,Fo,npH)+  (H6) 

or  a  flux  difference  splitter  (Roe-type  splitter,  [19]): 


difference  splitting  schemes.  Note  that  this  will  only  be 
valid  where  H ,  11* ,  II  '  and  Q  are  f.nooth  functions.  As 
for  the  linear  problem,  no  attention  will  be  paid  to  the 
time  discretization  which  nevertheless  still  has  an  effect 
on  the  space  discretization.  - 


5.2.1  FVS-HOR-Scheme 
Theorem  5.3 

The  space  discretization  of  the  scheme  given  by  eq.  (114) 
and  eq.  (116)  is  order  hk  accurate  on  any  type  of  mesh 
wherever  the  solution  is  smooth: 

[i>(tf./,)J  (?)  =  <  V.P  >p  +0(hk)D?iVl„  (118) 


Proof  of  Theorem  5.3: 

To  proof  order  hk  space  accuracy,  one  studies  the  trun¬ 
cation  error  by  deriving  the  equivalent  differential  equa¬ 
tion  using  local  Taylor  series  expansions.  Using  eqs.  (92) 
or  (93)  and  if  the  exact  solution  q  of  the  equivalent  differ¬ 
ential  equation  varies  smoothly,  scheme  (114)  with  (116) 
can  then  ho  rewritten  as  (when  carrying  out  a  Taylor 
series  development  of  II  pR  and  II RR  around  go): 


[d(^./,>]  (?)  = 

a-.  EE  [^r(9o)+0(^+,)i>:+,i>)/+ 

F  R  O 

+  II?R(qo)  +  0(Ak+,)T>5+lX>^_]  .Aso 

=  II  (qo,r°,iii>n).&3o 

R  O 

+<>(/.  W  z>U- 

P  R  O 


+  E  jjCfirobmf 

la  1 

+a(hk)vk+'v}, 


■A  so 


where  the  definition  of  Cf  is  given  in  cq.  (90)  while  the 
one  of  9Hf  is  similar  to  the  one  found  in  cq.  (69). 


i  [f/  ( Q(pl,fo,npR )  +  H  (Qro^o^Tph)] 

-  -  \Cm(Qm,  HPR)\ .  (<?&  -  <?(po)  0»7) 

where  the  value  of  Qp^  and  Q{Rq  is  the  value  of  k,h  order 
polynomial  reconstruction  at  the  Gauss  points  O  on  the 
cell  interface  between  cell  P  and  its  neighbour  R,  seen 
from  cell  P  re»jp.  cell  R,  see  also  eq.  (104),  (105)  ,  (106) 
and  (107).  The  symbol  Cm  stands  for  the  Jacobian 

taken  at  the  Roe-averaged  state  Qm  (Qpo>Qjio)  •  Note 

also  that  for  both  the  eqs.  (116)  and  (117)  the  following 
is  valid: 

flpR(Q,Q,r)  =  H(Q,f,nPR) 

It  will  now  be  shown  that  the  space  discretization  of  the 
scheme  given  by  eq.  (114)  and  eq.  (115)  is  order  hk  ac¬ 
curate  on  any  type  of  mesh  for  both  flux  vector  and  flux 


The  previous  expression  for  the  space  discretization  can 
be  rewritten  as  a  continuous  contour  integral  if  the  num¬ 
ber  of  Gauss  points  O  on  the  edges  is  larger  than  or  equal 
to 

[d(*.P)  ]  (?)  = 

±-.f^  [ll{q,r,n)  +  0(kk^)].ds 

+C(hk)Pk+'V), 

=  <  >p  +c>(A*)p$+,  r>« 

which  completes  the  proof  of  kth  order  space  discretiza¬ 
tion.  Note  also  that  one  is  discretizing  mean  values  over 
a  cell  even  when  the  GC-HOR-Algorithm  is  used.  Point 
values  are  discretized  only  if  the  order  of  the  reconstruc¬ 
tion  fc  <  1. 

G 
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5.2.2  FDS-HOR-Scherae 
Theorem  5.4 

The  space  discretization  of  the  scheme  given  by  eg.  (tif) 
and  eq.  (it 7)  is  order  hk  accurate  on  any  type,  of  mesh 
wherever  the  solution  is  smooth : 

[£>(? .Z1)]  (q)  =  <  V.P  >P  +0(hk)Vk+'Vl„1>lA  (119) 
Proof  of  Theorem  5.4: 

Again  using  eqs.  (92)  and  (93)  and  if  g  varies  smoothly, 
scheme  (114)  with  (117)  can  be  rewritten  as  (when  car¬ 
rying  out  a  Taylor  series  development  of  II,  q  and  Cm 
around  go)- 


D(9.P)  ~ 

A 

207 


££{"<•» 

R  O 


+0(1 i‘+,)Pj+1Z>), 

+  h(qo,ro,npR)  +  0(hk+i)T>kq+'l>tH 
+  [Cm(qo,qo) 

-+-C»(/»fc+1  )X>J,]  £>(**'-*' 1  )T>J+1 }  .A*0 


=  H (?o. ro,  npn)  Asp 


4-C7(h*)27j+,P}, 

=  orf/EE^  («o,fo.n/*R)].Aso 
r  h  o 

+0(hk)Vk"V),ViA 


From  here  on,  the  same  reasoning  is  followed  as  for  the 
FVS-HOR-Schemc  whereby  it  is  required  that  the  num¬ 
ber  of  Gauss  points  O  on  the  edges  is  larger  than  or  equal 
to  This  brings  one  then  to  the  conclusion  that  FDS- 
HOR-Schcmes  arc  kth  order  space  accurate  on  any  type  of 
mesh  even  if  considerable  grid  irregularities  arc  present. 

O 


5.2.3  Polynomial  Preservation 

As  with  the  linear  problem,  let  us  now  investigate  what 
hai  pens  to  an  exact  polynomial  solution  q  of  degree  r 
satisfying  the  steady  state  of  eq.  (110)  in  12  when  passing 
it  in  the  scheme  given  by  eq.  (114).  If  the  degree  of 
the  polynomial  solution  does  not  exceed  the  order  of  the 
reconstruction  k,  then  Qpo  =  Qro  =  ?o  and  eq.  (114) 
reduces  to: 


gP*  ~  <Fr  - 

At 

-  7r~  E  E  ^  (jo.ro,  fw-n)  A  So  +  D(S) 

P  R  O 

R  O 


+  E  ^cf(?o).5 Hf 

Im  I  * 

+0(hr)VrJ' 


A  so  +  D(S) 


If  the  number  of  Gauss  points  on  each  edge  is  larger  than 
or  equal  to  one  finds,  after  applying  the  Gauss  the¬ 
orem: 


^-^-•^-1 V.P.dn  =  D(S)  F 0(hT)Ptfl 

from  which  it  follows  that  nonlinear  HOR-Schemes  arc 
only  polynomial  preserving  if  higher  order  derivatives  of 
II  w.r.t.  to  q  are  zero  and  if  there  is  a  sufficient  num¬ 
ber  of  Gauss  points.  As  for  the  linear  problem  D(S ) 
should  be  an  exact  discretization  of  the  source  term.  Note 
that,  in  the  general  case,  the  difference  between  D(S)  and 
<  S(q,  r)  >p  is  given  by  eq.  (121)  showing  that  the  dif¬ 
ference  is  not  zero.  It  will  be  zero  if  the  source  term  can 
be  averaged  exactly.  In  that  case,  it  will  be  shown  here 
that  if  a  scheme  is  polynomial  preserving  up  to  degree  k 
then  it  is  of  order  hk  in  space. 

5.2.4  Flux  Reconstruction 

For  nonlinear  problems  it  seems  logic  to  reconstruct  the 
fluxes  rather  than  the  solution  variables  themselves,  in 
particular  when  ENO-reconstruction  is  used,  see  sec¬ 
tion  6.  It  is  expected  to  achieve  a  better  control  over 
the  monotonicity  of  ii.e  scheme. 

Shu  and  Oshcr  (31,  14]  devised  indeed  a  Lax-Fricdrichs 
type  of  scheme  where  the  numerical  flux 

F(Q)  =  F(Q)  +  aQ 

is  reconstructed  using  higher  degree  polynomials.  They 
presented  interesting  results  in  two  space  dimensions. 

A  slightly  different  approach  was  brought  up  by  Har- 
ten  [16]  whereby  the  averages  of  the  reconstructed  solu¬ 
tion  Qreco  and  the  reconstruction  of  the  projected  flux 

vector  (P-Spr)  =  HTpn°  arc  evaluated  over  each 
cell  edge  using  an  app-opriatc  Gauss  quadrature  for¬ 
mula.  These  edge-averaged  values  available  from  both 
sides  of  the  edge  arc  then  used  to  solve  a  single  Riemann 
problem  using  a  Roe-type  approximate  Riemann  solver 
rather  than  the  number  of  times  as  would  be  required  by 
the  quadrature  formula.  This  option  depends  of  course 
strongly  on  the  computational  cost  of  the  approximate 
Riemann  solver. 

Hailen  proposes  two  ways  of  reconstructing  the  fluxes. 
A  first  one  consists  of  writing  the  physical  flux  vector 
as  an  analytical  function  of  the  solution  reconstruction 
polynomial.  This  function  is  then  rewritten  as  a  kind  of 
truncated  Taylor  expansion  whereby  analytical  expres¬ 
sions  for  the  space  derivatives  are  used.  The  coefficients 
in  this  expansion  arc  such  that  the  reconstruction  con¬ 
serves  the  mean.  A  second  method  uses  a  gravity  center 
reconstruction  whereby  the  values  of  the  physical  flux 
vector  at  the  cell  centroids  are  evaluated  using  the  value 
of  the  zero- mean  solution  reconstruction  at  those  cen¬ 
troids.  The  first  flux  reconstruction  method  seems  to 
be  most  appropriate  for  unstructured  grid  solvers  imple¬ 
mented  on  parallel  computers.  The  second  method  has  a 
lower  computational  cost  but  requires  more  storage  and 
results  in  more  inter- processor  communication. 


5.3  Discussion 

From  eq.  (108),  (118)  and  (119)  it  follows  that  the 
schemes  (101)  and  (114)  can  become  inconsistent  when 
piecewise  constant  solution  reconstruction  is  used.  In 
practice,  piecewise  constant  solution  reconstruction  al¬ 
ways  gives  reasonable  results  even  on  irregular  meshes 
which  indicates  that  the  scheme  is  at  least  first  order. 
LeVeque  [32]  showed  in  ID  that  first  order  accuracy  is 
indeed  achieved  on  irregular  meshes.  The  proof  in  2D  is 
presented  for  a  linear  scheme  in  appendix  B  and  depends 


on  the  time  discretization  and  the  fact  that  the  scheme 
is  stable  for  CFL-mi inbets  which  arc  of  order  h. 

Eq.  (108),  (118)  and  (119)  indicate  that  the  truncation 
error  of  the  space  discretization  is  at  least  of  order  hh, 
which  does  not  exclude  that  the  complete  scheme  can 
be  of  order  Ak+1  not  only  on  regular  meshes  but  also 
on  irregular  meshes.  Actually,  the  choice  of  the  time 
discretization  can  be  such  that  its  truncation  error  com¬ 
pensates  the  terms  of  order  hk  in  the  space  discretization 
truncation  error  to  order  .  LeVcquc  made  some  sug¬ 
gestions  in  this  direction,  while  Shu  [31]  used  a  Runge- 
Kutta  type  of  explicit  time  stepping  whereby  the  coeffi¬ 
cients  where  optimized  for  space  accuracy  and  stability. 
Further  comments  on  time  integration  will  be  given  in 
section  5.5. 

Remark  also  that  no  lower  order  derivatives  are  present  in 
the  truncation  error  given  by  cq.  (108),  (118)  and  (119). 
In  other  words,  terms  of  the  form: 

0(hr)V%  with  r  <  Jfc  and  p  <  r 


Proof  of  Theorem  5.5: 

One  assumes  that  the  exact  averages  of  a  smooth  exact 
solution  q  are  given  in  each  cell  P.  If  aow  also  S  is  a 
smooth  function  of  q  then  from  eq.  (122)  and  cq.  (92) 
and  after  performing  a  Taylor  series  development  of  S 
around  q  it  follows  that: 

S‘(f)  =  S(,,F)+0(hk+')l>  |  (123) 

This  leads  now  to  the  following  relation  between  higher 
order  derivatives  of  S  and  5* : 

ds;p  =  asf  +  o(h(k+,)"')i>|  (124) 

Inserting  cq.  (123)  and  (124)  in  cq.  (121),  one  finds: 
[D(S))(q)  =  S[q(  fHr>] 


1*1 


=  <5(«,0  >*’  +<>(*.*+')  (125) 


do  not  occur.  This  guarantees  that  the  scheme  has  no 
second  ,■  ,der  diffusive  errors  if  the  order  of  the  solution 
reconstruction  is  larger  than  2.  If  the  order  of  reconstruc¬ 
tion  is  1,  the  scheme  might  be  second  order  accurate  (see 
LeVequc)  but  still  have  diffusive  errors.  This  implies  the 
presence  of  the  following  term  in  the  truncation  error: 

0(h3)V*  whereby  k  =  1 

This  idea  can  be  generalized  to  higher  order  stating  that 
a  scheme  with  an  0(hk)  space  operator  might  become 
0(Ak+I)  due  to  the  time  discretization  while  derivatives 
of  order  fc+ 1  remain  present  in  the  truncation  error  which 
becomes  then  of  the  following  shape: 


0(hk+1  )Dk" 


5.4  Source  Term 

Eq.  (108),  (118)  and  (118)  indicate  that  the  discretiza¬ 
tion  of  the  source  term  S{q,  F)  must  satisfy  the  following 
equation  when  0(hk  )  space  accuracy  is  to  lie  obtained: 


[*>($)](«)  =  <$(?,  •')>'’  +  0(hp)  with  p  >  k  (120) 

If  5  is  a  known  function  of  r  and  linear  in  q  then  1){S)  is 
taken  as  the  exact  mean  of  the  function  over  the  cell  P 
with  q  taken  in  the  gravity  center.  In  all  other  cases  the 
following  discretization  is  proposed: 


[D(S)](Q)  =  S  [Q^(r»,r>]  +  £  Jlf  P  P 

(121) 

in  which 

S*(F)  =  5[Q(pk)(r),rj  (122) 

and  where  the  vector  dS*'p  is  defined  in  a  similar  way 
as  in  cq.  (69).  The  derivatives  in  this  vector  take  known 
expressions.  The  following  theorem  is  going  to  be  proved: 

Theorem  5.5 

The  source  term  discretization  given  by  eq.  (121 )  satisfies 
the  condition  (120)  for  higher  order  space  accuracy. 


a 

Remark  that  a  simpler  source  term  discretization  can  be 
used  in  combination  with  the  GC-HOR- Algorithm: 


[/;(.<?)]  (<?)-S(Qz.,r» 

•+><* 

+  £  <126) 

.+J>0 

i.c.  the  reconstruction  of  the  source  term  using  the  point 
values  at  the  reference  points  of  the  cells.  The  symbol  Q 
still  denotes  a  discrete  solution  at  a  certain  moment  dur¬ 
ing  the  computation.  The  discretization  (126)  satisfies 
also  condition  (120)  as  long  as  S  and  the  exact  solution  q 
are  smooth  functions.  To  proof  this,  it  is  sufficient  to 
let  D(S)  act  upon  the  exact  solution  q  and  to  perform 
a  Taylor  series  development  of  S(qrc ,  fn- )  around  the 
point  fp  and  truncate  after  the  terms  of  order  k. 


5.5  Time  Discretization 

The  problem  at  hand  is  to  find  a  good  approximation  for 
the  time  dependent  term 


1 

Up 


dt 


•</fi  =  <  qt  >' 


(127) 


in  eq.  (100)  and  eq.  (113)  where  q  is  the  exact  solution 
of  the  respective  analytical  problems.  The  discretization 
is  [airly  different  for  ZM-HOR  and  GC-HOft-Algorithins 
algorithms  described  in  section  4.  Therefore,  the  time 
discretization  for  schemes  based  on  the  ZM-HOR  and 
the  GC-H OR- Algorithm  will  be  discussed  separately  for 
the  linear  problem  with  source  term  given  by  eq.  (97). 
Other  time  discretizations  reported  in  the  literature  will 
be  addressed  as  well. 


5.5.1  ZM-HOR-Algorithtu 

Consider  the  following  discretization  D(qt)  of  the  term 
given  in  cq.  (127): 

[di*mq)  =  q*  (128) 

where  Q  is  a  discrete  solution,  see  also  eq.  (101).  The 
mean  values  in  the  right  hand  side  of  eq.  (128)  could  be 
computed  as  the  mean  of  the  reconstruction  polynomial 
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over  cell  P,  However,  from  eq.  (S4)  anil  the  fact  that 
zero-mean  polynomials  arc  used,  it  follows  that  thejccll 
average  of  the  kth  order  reconstruction  of  Q  equals  Qp. 
Hence,  no  cell  averaging  phase  is  needed  in  the  updating 
phase  of  the  computation.  Proin  cq.  (101)  one  finds  then 
that  the  new  cell  value  of  the  discrete  solution  at  the  next 
time  step  can  immediately  be  evaluated  as: 

< 7?'  [/)(«. +  !>(£)]  (Qn)  (129) 

Let  us  now  investigate  the  truncation  error  of  the  dis¬ 
cretization  given  in  cq.  (128)  by  inserting  the  exact  so¬ 
lution  q  and  performing  a  Taylor  expansion  in  time  and 
space  around  q'pB  <  q(r,  <")  >p .  One  gets: 


[D  (?,)](„) 


<q(f,t"^)>P  ~<q(?,tn)  >P 

At 

<  q,  >P  +0(M)  (130) 


5.5.2  GC-HOIl-Algorithm 

The  time  discretization  in  eq.  (101)  or  cq.  (128)  is  based 
on  mean  values  of  the  solution  over  the  cell  P,  also  for  the 
scheme  using  the  GC-HOH-algorithm.  However,  unlike 
the  scheme  using  the  ZM-HOR-algorithm,  the  mean  of 
the  solution  over  the  cell  is  not  stored  but  the  value  of  the 
solution  at  some  reference  point  (in  general  the  gravity 
center).  Hence,  a  averaging  phase  will  be  needed  in  the 
updating  phase  of  the  computation: 


Qr+l 


-Q"p 


<  Qp*'  -rW.Q"+,.G  >p 
-  <  QnF+  W.Qn.G  >p 
AQr  +  W.AQ.IPP  (131) 


where  W,  G  and  Q  are  defined  in  eq.  (58),  (78)  resp.  (80), 
vector  !/>  />  is  given  by: 


XP.P  f.P.P  ,  P.P 
1  ~  ['i.ff  v> 


(132) 


and  A Qr  and  AQ  stand  for  the  time  changes  of  the  solu¬ 
tion  values  in  centroids  of  cell  P  and  .csp.  ail  colls  belong¬ 
ing  to  the  support  of  cell  This  means  that  eq.  (131) 
leads  to  a  system  of  equations  since  it  has  to  be  satisfied 
for  all  cells  simultaneously.  This  system  could  be  solved 
with  any  direct  or  iterative  inversion  algorithm.  Since 
this  is  an  expensive  operation,  mass  lumping  can  be  ap¬ 
plied  just  as  is  done  in  Finite  Element  techniques.  This 
means  that  instead  of  averaging  the  complete  reconstruc¬ 
tion,  only  the  constant  part  Qp  is  taken  into  account. 
However,  since 


<  Qp*  >'  =  Qr  +  0(h).VQ 

the  averaging  phase  will  introduce  an  additional  first  or¬ 
der  space  error.  If  one  is  only  interested  in  steady  state 
solutions  whereby  Z>J  tends  to  zero,  this  temporarily  in¬ 
creased  truncation  error  is  acceptable.  The  new  discrete 
values  at  a  next  time  level  are  then  given  by: 


Qr"  =  Qnr  +  A<  [/>(S)  -  Z>(mf,)]  (Qn) ,  VP  (133) 


Q(i)  =  c/°>  +  At  'f^dkC  [Q(k)]  (131) 

fcsO 

with 

C  M  =  [0  (ff.fy)  +  z>(S)]  (Q(fc)) 

whereby  »  =  1,2 . p  and  Q ^  =  Qn  and  =  Q,,+1. 

The  integer  p  denotes  the  total  number  of  intermediate 
steps. 

Shu  and  Osher  [3l]  have  chosen  the  coefficients  c,*  such 
that  the  scheme  is  TVD  under  certain  C’FL-restrictions 
(a  detailed  discussion  of  stability  and  Total  Variation  is 
deferred  to  section  6).  They  came  up  with  second,  third, 
fourth  and  fifth  order  schemes  of  which  only  the  second 
order  scheme  is  presented  here: 


Q<» 

=  Q<0)  +  AtC  [<2(0)] 

<J<2) 

=  Q<0)  +  ^At  [d  [Q(0>]  +  C 

[Q0)]  }(135) 

CFL 

<  1 

(136) 

(137) 

Note  that 

the  <7F£-restriction  is  valid  only  in  ID  com- 

putations. 

For  two  calculations  the  limit 

is  1/2.  Note 

that  ENO-rcconstruction  is  needed  after  each  intermedi¬ 
ate  time  step,  see  section  6. 

Harten  [16]  proposed  a  single  step  higher  order  accu¬ 
rate  time  stepping  scheme  avoiding  multiple  invocations 
of  ENO- reconstruction  algorithms.  Harten’s  approach  is 
based  on  a  Cauchy-Kcwalewski  procedure  whereby  the 
solution  <?(?,<)  is  writt-  .  in  a  Taylor  series  expansion  in 
both  space  and  time  whereby: 


Qs  1  =  Sx  -  a.Qxx  -  b.Qxv 

Qyl  =  Sj,  -  tt.Qxy  —  h.Qyy 

Qtt  —  St  —  a.Qu  —  b.Qvt 

=  -a.Sx  —  b.Sy  +  a1  .Qxx  -p  2ab.Q,v  -f  b2  ,Qvy 

-  -((«-«) s| +  [H“«] 


These  expressions  arc  of  course  only  valid  if  one  solves 
the  linear  convection  problem  given  by  eq.  (97).  Note 
also  that  one  can  then  show  via  a  recursive  proof  that: 


It  is  clear  that  the  Taylor  series  expansion  of  the  recon¬ 
structed  solution  Q(f,  t)  taken  at  the  initial  time  level 
<n  will  be  equal  to  the  reconstruction  polynomial  at  this 
time  level.  Hence,  all  mixed  time/space  derivatives  can 
be  expressed  in  pure  spatial  derivatives  which  can  be  eval¬ 
uated  using  the  reconstruction  polynomials.  The  scheme 
becomes  then: 


5.5.3  Rungc-Kutta  Methods  Q’p*' 

The  general  explicit  p-stage  Runge-Kutta  method  can  be 
written  as: 


<Tr 

-  £  { [/>(<?.?«)  +  i?(S)]  (Q(r,  r))}  .dr 
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The  continuous  time  integration  is  then  approximated  by 
some  Gauss  quadrature  satisfying  the  desired  time  accu¬ 
racy.  The  values  of  the  solution  at  intermediate  time 
levels  and  at  the  several  Gauss  points  on  the  cell  inter¬ 
faces  arc  evaluated  using  the  Taylor  series  expansion  in 
both  time  and  space.  The  CFL-limit  for  this  scheme  is 
about  1/2. 

5.6  Distinction  with  Finite  Elements 

HOR-Schcmes  differ  from  Finite  Element  methods  (see 
the  book  of  Johnson  C.  [33])  in  that  their  data  repre¬ 
sentation  is  allowed  to  be  discontinuous  whereas  classical 
Finite  Elements  methods,  such  as  the  Thylor-Galerkin  or 
the  Streamwise  Upwind  Petrov-Galerkin  (SUPG)  meth¬ 
ods  [34,  10],  use  a  continuous  data  interpolation  1  aged 
on  Galerkin  basis  functions.  The  discontinuous  interpo¬ 
lation  used  in  the  HOR-Schemes  allows  a  straightforward 
incorporation  of  one  dimensional  Riemann  solvers  having 
upwind  properties.  Note,  however,  that  recently  Finite 
Element  discretizations  have  been  devised  using  discon¬ 
tinuous  solution  representations. 

Finally,  the  spatial  size  of  the  support  increases  with  the 
order  of  accuracy  in  HOR-  Schemes.  The  order  of  a  Finite 
Element  scheme  is  increased  by  adding  new  degrees  of 
freedom  in  the  same  element  hence  avoiding  a  spatial 
growth  of  the  support. 

6  ADAPTIVE  STENCILS 
6.1  Introduction 

A  general  explicit  scheme  can  be  cast  as,  sec  also 
cqs.  (101,(102)  and  (114), (115) 

<&+*  =*>(<&•)  (138) 

with  D  some  discretization  operator  acting  upon  the 
solution  given  in  each  cell  R *  of  the  support,  The 
scheme  (138)  is  called  Total  Variation  Diminishing 
(TVD)  if: 

TV  (Qn+ ') -TV  [D  (Q" )]  <  TV  (Qn)  (1 39) 

with 

7V(<n=£El<?K -<?'/•!  (140) 

p  k>H 

in  which  R  points  to  a  direct  neighbour  of  cell  P.  Mar¬ 
ten  [35,  36]  showed  the  following  properties  of  linear 
schemes  for  scalar  problems;  i.e.  the  operator  D  is  linear: 

1.  A  monotone  scheme  (1)  is  a  monotone  function  of 
its  arguments)  is  always  TVD,  but  not  vice  versa. 

2.  All  coefficients  in  a  monotone  scheme  are  positive. 

3.  A  TVD-schemc  is  always  monolonicity  preserving, 
but  not  vice  versa.  Monotonicity  preservation  im¬ 
plies  that  if  Q  is  monotone  on  a  given  mesh,  then 
I)(Q)  will  also  be  monotone  on  the  same  mesh. 

4.  A  monotonicity  preserving  scheme  is  always  at  most 
first  order  accurate  in  space.  From  this  it  follows 
that  monotone  and  TVD-schemes  are  also  always 
first  order  accurate. 

For  nonlinear  schemes,  however,  the  following  properties 
arc  valid: 

1.  A  monotone  scheme  is  always  at  most  first  order 
accurate  in  space. 


2.  AU  coefficients  in  a  monotone  scheme  arc  positive. 

3.  A  TVD-scheme  is  always  monotonicity  preserving, 
but  not  vice  versa. 

4.  A  monotonicity  preserving  scheme  can  be  higher  or¬ 
der  accurate  in  space,  hence  also  a  TVD-schemc, 

5.  If  one  has  a  uniform  boundedness  of  the  TV  then 
convergence  to  a  weak  solution  is  achieved. 

6.  If  the  TV  of  the  discrete  solution  at  time  level  n  +  1 
docs  not  exceed  the  TV  of  the  reconstruction  of 
the  solution  at  time  level  n  or,  in  other  words,  if 
TV[D(Qn)]  <  TV  (Qr"°'n),  then  the  scheme  is 
TVD. 

7.  TVD-schemes  are  always  stable  but  are  at  most  sec¬ 
ond  accurate  in  Bpaee. 

Considering  the  schemes  given  by  eq.  (101)  and  (114) 
with  the  above  properties  in  mind,  one  finds  that  they 
are  not  monotore  nor  TVD.  Hence,  convergence  is  not 
at  ali  guaranteed.  In  order  to  stabilize  these  higher  order 
schemes,  their  discretization  operator  D  must  be  made 
nonlinear. 

As  higher  order  accuracy  is  required,  the  resulting  non¬ 
linear  schemes  cannot  be  TVD  since  the  latter  can  not 
achieve  more  than  second  order  accuracy.  One  tries 
therefore  to  make  an  "almost  TVD”  scheme  based  on  Es¬ 
sentially  Non- Oscillatory  (END)  solution  reconstruction; 
i.e.  a  reconstruction  satisfying: 

TV  (Qrtro)  <  TV  (Q)  +  O  (Afc+1)  (141) 

As  the  support  of  the  reconstruction  is  arbitrary,  the  con¬ 
dition  (141)  can  be  fulfilled  by  selecting  the  support  in 
an  adaptive  way  such  that  no  solution  discontinuities  of 
order  less  than  or  equal  to  fc  are  traversing  the  zone  of  the 
support.  The  nonlinearity  of  the  discretization  operator 
lies  then  in  the  fact  that  cells  arc  accepted  or  rejected 
from  the  support  after  each  iteration.  Such  a  scheme  is 
also  called  a  moving  stencil  scheme  (MS).  As  the  order 
of  accuracy  is  independent  on  the  support  choice  (sec 
section  5),  the  accuracy  of  M  5-schemes  is  not  affected 
by  moving  the  stencils.  Hence,  Af  A'-schemes  cannot  be 
TVD.  However,  no  proof  is  available  yet  that  would  guar¬ 
antee  the  uniform  boundedness  of  the  TV  and  hence  con¬ 
vergence.  Nevertheless,  the  numerous  computations  re¬ 
ported  on  in  the  literature  indicate  the  stable  properties 
of  M 5-schemes. 

In  this  section,  two  adaptive  support  selection  algorithms 
will  be  discussed.  The  first  algorithm  performs  a  sup¬ 
port  gathering  by  heuristic  marching  (SSM- Algorithm, 
Support  Selection  by  Marching).  The  second  algorithm 
selects  the  support  out  of  a  set  of  candidate  supports 
such  that  the  global  norm  of  the  derivatives  of  the  re¬ 
construction  polynomial  are  minimal  at  the  gravity  cen¬ 
ter  (SSG-Algorithm,  Support  Selection  for  Global  norm 
minimization). 

6,2  SSM-Algorithm 

6.2.1  General  Concept 

The  support  of  a  cell  P  always  contains  the  cell  P  itself 
whereafter  the  two  direct  neighbours  making  the  smallest 
gradient  are  added.  The  gradient  associated  with  two 
direct  neighbours  is  estimated  using  a  local  triangulation 
connecting  the  gravity  centers  of  the  direct  neighbours 
with  the  centroid  of  cell  P,  see  fig.  31. 
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Fig.  31 :  Local  triangulation  foi  gradient  estimation. 


One  assigns  the  given  cell  averages  (ZM-llOR- Algorithm) 
or  point  values  (GC-HOll- Algorithm)  of  the  solution 
function  q  to  the  vertices  of  each  local  triangle.  The  gra¬ 
dient  in  such  triangle  is  then  obtained  through  a  linear 
interpolation  over  the  triangle: 

a  I  1  f  Aqr\  Ajqq  -  1 

,lAi’i2  ~  2ft api2  "  L  A?pi  Axpj  c  AqpjAxpi  J 

(142) 

where 

Aqpi  —  q\  -  qp  and  A qn  =  4a  —  Qp 
and  similar  definitions  for  the  coordinates  x  and  j/. 

After  inserting  eel!  1  and  2  in  the  support,  a  marching 
procedure  u  stal  led  from  cell  1.  Another  pair  of  direct 
neighbours  of  cell  1  is  added  if  the  gradient  differs  least 
from  the  gradient  that  was  computed  when  cell  1  was 
added  to  the  support.  In  other  words,  the  support  is 
selected  in  the  /one  where  the  solution  function  q  remains 
as  dose  as  possible  to  a  31)  plane  passing  through  the 
point  rp(xp,yp,qp)  and  having  $qi,p  as  a  gradient. 

The  selection  algorithm  includes  central  biasing.  This 
means  that  a  central  support  will  be  selected  if  the  gra¬ 
dients  estimated  with  each  pair  of  direct  neighbours  differ 
very  little.  In  that  case  all  direct  neighbours  arc  included. 
Central  biasing  is  interesting  since  the  round-ofT  error  for 
the  reconstruction  is  then  minimal. 

Finally,  note  also  that  the  algorithm  breaks  down  on  reg¬ 
ular  meshes  if  the  order  of  the  reconstruction  is  larger 
than  one.  Problems  as  described  in  section  4.5  occur  in 
this  case. 

0.2.2  Detailed  Procedure 

Note  that  the  selection  procedure  is  implemented  in  an 
Object-Oriented  manner  using  C++.  Therefore  the  fol¬ 
lowing  discussion  will  be  more  data  and  event  related  in¬ 
stead  of  tracing  the  more  traditional  paths  of  flow  charts 
and  functionalities. 

6.2.3  Startup 

The  minimum  number  of  cells  n  needed  in  the  support  is 
set  equal  to: 


«  =  5(*  +  l)(*  +  2)  +  2 

Two  cells  ate  added  in  order  to  have  better  condi¬ 
tioned  matrices,  in  particular  for  regular  meshes  (see  sec¬ 
tion  4.5).  Besides  this,  a  discontinuity  sensor  4  is  initial¬ 
ized  to  5  (see  subsection  $.2.5). 

The  support  used  during  the  previous  iteration  is  emp¬ 
tied  and  the  current  cell  P  is  added  as  a  first  cell  in  the 
support  for  the  iteration  at  hand. 


6.2.4  Search  for  Lowest  Gradient 

Using  the  local  triangulation  with  the  neighbours  of  P 
as  described  in  subsection  $.2.1,  one  stores  the  corre¬ 
sponding  norms  of  the  gradients  in  increasing  order.  The 
value  following  which  the  neighbours  arc  sorted  is  called 
the  sorting  variable  which  now  is  the  square  of  the  norm 
of  the  gradients  of  the  variable  q  one  is  reconstructing. 
The  resonated  first  and  second  neighbour  of  each  triangle 
arc  listed  in  the  same  order  as  well  as  the  gradient  vec¬ 
tors.  The  object  containing  all  this  information  is  called 
a  neighbour  sorter,  named  NSp\ 
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where  N\  and  N2  are  the  lists  of  the  first  resp.  second 
neighbour  forming  the  local  triangle. 

For  each  neighbour  11  of  P  another  neighbour  sorter  NSr 
is  generated  whereby  the  sorting  variable  is  given  by  the 
following  expression: 


\\Vqi,R  - 


(143) 


in  which  the  first  subscript  t  is  associated  with  the  local 
triangle  Ah,,, >41  as  depicted  in  fig.  32,  while  the  second 
subscript  P  or  R  denotes  the  cell  around  which  the  local 
triangulation  is  made. 


rig.  32:  Triangle  Ah,.,»+i. 

The  lowest  sorting  variable  value  of  each  generated  neigh¬ 
bour  sorter  is  multiplied  with  the  square  of  the  norm  of 
the  gradient  in  the  associated  triangle  Ak.i.ii  1 : 


|^4>.n  “  ^IH.P 


l/t,W,ti»n 


(144) 


This  value  is  now  used  as  the  sorting  variable  for  a  new 
neighbour  sorter  NS pr  acting  upon  the  direct  neigh¬ 
bours  R  of  the  cell  P.  In  other  words,  a  neighbour  R 
comes  first  in  the  neighbour  sorter  if  its  smallest  relative 
gradient  difference  is  small  as  well  as  the  gradient  with 
which  the  smallest  relative  gradient  was  achieved. 


6.2.5  Discontinuity  Detection 

The  procedure  described  in  the  previous  subsection  re¬ 
sults  in  a  list  of  neighbours  ordered  with  increasing  asso¬ 
ciated  gradient  norms  and  norms  of  gradient  differences. 
This  is  still  not  yet  sufficient  to  come  to  the  final  decision 
of  accepting  a  neighbour  into  the  support.  Indeed,  the 
norm  of  the  gradient  differences  might  he  smaller  across 


ft 
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a  discontinuity.  Hence,  a  separate  discontinuity  detection 
is  required. 

Given  the  gradient  vectors  stored  in  the  neighbour  sorter 
NSr,  the  discontinuity  sensor  <f>  is  defined  as: 


£/)=i 


N.  ^ 


'  GK.F.min 


(145) 


where  |  vqn.r.mtn  ||  is  the  minimum  norm  of  the  gradient 
vectors  given  in  the  sorter  NSr  while  N  is  the  number 
of  direct  neighbours  of  the  cell  P.  The  sensor  <t>  is  a 
measure  of  the  gradient  variation  around  the  cell  P.  A 
cell  R  belonging  to  the  list  N i  of  the  sorter  N  Sp  will  be 
added  to  a  list  of  rejected  celts  if: 


<t>  >  'I'll  RES  (-  0.1  initially )  (146) 

and 

min  (||^9N.>>||  ,  |^4n-i.i>| 

>  ||^</n,7\titin  |  •  ^  +  5^  (147) 

where  the  subscript  R  points  now  to  the  R,h  cell  in  the 
cyclic  list  giving  the  direct  neighbours  of  the  cell  P  in 
counter-clockwise  direction,  see  also  fig.  33. 


Fig-  33:  Rejecting  cells  at  a  discontinuity. 

The  coefficient  0  in  the  dcnumcrator  in  the  right  hand 
side  of  e.q.  (147)  was  determined  empirically  to  comply 
with  the  widest  possible  range  of  gradient  values. 

0.2.0  Acceptance  of  Support  Cells 

Denoting  the  sorting  variable  value  of  neighbour  R  in  the 
neighbour  sorter  NSrn  by  SVr,  a  cell  iu  the  list  N j  and 
the  associated  cell  in  the  list  Nt  forming  the  local  triangle 
will  be  admitted  to  the  support  if: 

•  the  number  of  cells  i»  already  present  iu  the  support 
satisfies: 

»<  +  >)(*  +2)  +  2  (148) 

SVr  -  SVn.„.m 

SVR,m„  -  SVH,mm  1  ' 

where  0  <  e  <  1 

•  the  candidate  cell  is  not  in  the  list  of  rejected  cells 
and  not  already  present  in  the  support. 

The  second  condition  leads  to  a  central  stencil  if  the  gra¬ 
dient  is  almost  constant,  i.c.  for  almost  linearly  varying 
solution  functions.  The  parameter  <  is  called  the  central 
bias  parameter:  the  higher  its  value  is  set,  the  more  cen¬ 
tral  supports  will  be  obtained.  Cells  out  of  the  list  Aj 


arc  added  simultancr  usly  to  the  support  in  order  to  avoid 
having  a  one  dimensional  support  leading  to  the  problem 

discussed  in  section  •!.!>. 

When  a  cell  of  the  list  ,Vj  is  accepted  i  bra  aching  flag  is 
set  at  the  next  cell  in  the  list  N\.  This  ip  relevant  to  the 
marching  procedure  dcscrilred  in  subsection  0."  7. 

6.2.7  Marching 

Looping  now  over  the  new  cells  added  to  the  support, 
the  whole  procedure  starting  from  subsection  6.2.4  is  re¬ 
peated.  Again,  one  loops  over  the  latest  added  cells  and 
so  on. 

If  one  is  in  a  region  enclosed  by  discontinuities  or  do¬ 
main  boundaries,  it  could  be  that  no  more  cells  can  be 
admitted.  In  this  case,  one  tries  to  make  a  branch  in  the 
marching  at  cells  where  a  branching  flag  was  set.  If  this 
docs  not  lead  to  additional  support  cells,  one  loops  down 
in  the  support  and  tries  again  to  add  other  neighbours.  If 
still  this  remains  unsatisfactory,  the  threshold  for  the  dis¬ 
continuity  sensor  <f>  is  doubled  and  the  procedure  starts 
all  over  again.  This  can  be  repeated  until  the  threshold 
becomes  so  high  that  the  weakest  discontinuities  arc  just 
seen  as  sharp  gradients.  The  reconstruction  will  then  no 
longer  be  truly  KNO  and  the  only  cure  is  the  have  more 
and  finer  cells  in  these  /.ones. 

6.2.8  Boundaries 

Arriving  in  a  boundary  cell,  fig.  34,  one  sorts  first  the 
neighbours  of  the  interior  neighbour  if  based  on  the 
norms  of  the  absolute  gradient  vectors,  see  section  6.2.4. 


Fig.  34:  Gradient  estimation  at  boundaries. 

The  cells  A  and  71  are  direct  neighbours  of  the  interior 
neighbour  R  which  arc  vertices  of  the  two  local  triangles 
that  have  also  the  boundary  cell  P  as  a  vertex.  Their 
neighbours  are  sorted  using  the  following  sorting  variable: 

SVa  =  ||^.,a  -fya.i-lf 

SVn  = 

The  neighbours  of  the  interior  neighbour  arc  once  again 
sorted  but  now  using  the  relative  norm  of  the  difference  of 
the  respective  gradient  vectors  with  the  mean  of  the  gra¬ 
dient  vectors.  The  result  is  stored  in  a  neighbour  sorter 
NSr.  The  lists  N i  and  Ni  of  the  sorter  NSn  contain  each 
both  the  cells  A  and  It.  If  SVa  <  SVy,  one  selects  cell 
A  from  list  N\  together  with  the  corresponding  sorting 
variable  value  and  gradient  vector  in  NSr.  'These  three 
items  are  then  stored  in  the  neighbour  sorter  NSpn  of  t  he 
boundary  cell  P.  If  5V/t  >  SVy ,  the  cell  D  is  selected 
from  the  list  N2  of  NS r  as  well  as  the  corresponding 
sorting  variable  value  and  gradient  vector.  Again,  these 
items  arc  stored  in  NSm. 

The  neighbour  sot  ter  NSrR  finally  contains  a  degenerate 
triangle  &r,p,r  while  the  gradient  vector  is  the  gradient 
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in  the  local  triangle  &n,A,i>  or  &:u\n  having  the  small¬ 
est  relative  difference  in  gradient  vectors.  The  sorting 
variable  value  is  the  square  of  the  relative  norm  of  the 
difference  between  the  gradient  vector  in  the  selected  tri¬ 
angle  and  the  mean  gradient  vector  of  the  interior  neigh¬ 
bour  R. 

The  reason  behind  this  rather  complex  procedure  is  the 
fact  that  at  weak  discontinuities  cither  cell  A  or  If  lies  on 
the  other  side  will  still  having  the  lowest  absolute  gradi¬ 
ent  vector.  This  is  anticipated  by  looking  to  the  smallest 
relative  gradient  vector  differences  of  both  cell  A  and  If. 
The  result  might  be  that  one  has  to  select  the  neighbour 
with  the  higher  gradient  but  with  the  smallest  gradient 
variation. 


0.3  SSGAlgoritlun 

The  SSG-Algorithin  is  a  generalization  towards  meshes 
with  arbitrary  polygonal  cells  of  a  similar  algorithm  pro¬ 
posed  by  Marten  [16]  for  triangular  grids.  The  algorithm 
suggests  itself  a  set  of  candidate  supports  around  a  cell 
P  and  selects  then  the  support  for  which  the  global  norm 
of  the  derivatives  of  the  reconstruction  polynomial  at  the 
gravity  center  of  tlto  cell  P  is  minimal.  The  global  norm 
||f?|li.  is  given  by: 


whereby 


d'+V,c# 


Ox'd  y> 


=  A,., 


(151) 


for  both  the  GC-Algorithni  and  the  ZM-Algorithm.  The 
coefficient  A,,t  is  an  element  of  the  vector  A.  given  by  for 
instance  eq.  (37).  This  implies  that  the  weight  matrix  W 
is  to  be  recalculated  for  each  candidate  support. 


6.3.1  Candidate  Supports 

The  first  candidate  is  always  the  central  support,  i.e.  the 
support  obtained  by  taking  the  neighbours  of  the  neigh¬ 
bours  and  so  on  without  takieg  any  other  limitation  into 
account.  After  this,  /V  sector  supports  arc  gathered  with 
A'  the  number  of  sides  of  a  cell  P.  The  gravity  center 
of  the  cells  of  a  sector  support  lies  entirely  in  a  sector 
limited  by  the  two  lines  connecting  the  gravity  center  of 
cell  P  with  two  consecutive  vertices,  see  lig.  35. 


Fig.  35:  Division  by  Sectors. 

Note  that  a  point  P  lies  within  a  sector  defined  by  two 
half  lines  L\  and  Lj  defined  by  their  direction  vector  at 
resp,  <?j  if,  see  fig.  36: 


Fig.  36:  Condition  to  ly  within  a  sector. 


The  conditions  to  be  satisfied  in  order  to  accept  a  cell  in 
a  sector  support  are  the  same  as  for  the  SSM- Algorithm 
except  for  condition  (119). 

l  or  each  of  the  N  +  I  candidate  supports  the  global  norm 
||Z?||j,  is  computed  and  the  candidate  with  the  small¬ 
est  global  norm  is  selected.  Note  also  that  the  norm  ol 
the  central  candidate  support  is  divided  by  two  in  order 
to  establish  a  bias  towards  central  supports  in  order  to 
improve  the  precision,  see  also  Kartell  and  Chakravar- 
thy  [16]  or  Shu  [37], 


Fig.  37:  Too  small  sector  supports  in  corners. 


6.3.2  Exceptions 

l  It  might  happen  in  corners  formed  by  two  discon¬ 
tinuities  or  by  two  domain  boundaries  that  not 
enough  cells  can  be  accepted,  see  lig.  37. 

r)io  opening  -nMglc  of  (he  senior  is  then  £r^dv.;»!!y 
increased  until  enough  cells  can  be  accepted .  Can¬ 
didate  supports  will  then  of  course  be  partially  over¬ 
lapping  If  still  not  enough  cells  can  be  found,  the 
opening  angle  of  the  sector  is  decreased  back  to  its 
initial  value  but  the  value  of  'J'll RPR  in  the  shock 
detection  condition  (146)  is  increased  by  a  factor  5 
until  enough  cells  were  gathered. 

If  no  cells  were  ever  rejected  due  to  condition  (146) 
and  if  a  large  number  of  sector  widenings  took  place, 
the  candidate  is  no  longer  considered  when  still  not 
enough  support  cells,  were  found.  In  this  case,  one 
was  dealing  with  a  corner  of  the  domain  where  the 
solution  variables  were  varying  smoothly. 

2.  When  a  neighbour  of  cell  P  is  a  boundary  cell,  the 
associated  sector  support  is  always  too  small  and 
hence  is  rot  considered  as  a  candidate  support,  see 
fig.  38. 


6.3.3  Boundaries 


«i  ®  TAP  >  9  ami  03  ®  tap  <  0  (152) 

where  the  vector  far*  connects  the  position  of  the  apex  A 
of  the  sector  with  the  position  of  the  point  P. 


The  candidate  supports  for  the  reconstruction  in  bound¬ 
ary  cell  P  consist  of  the  central  support  of  the  interior 
neighbour  and  of  the  sector  supports  associate*!  with  the 
interior  neighbour.  However,  the  latter  do  not  contain 
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Fig.  38:  Sector  formed  with  *  boundary  cell, 


the  interior  neighbour  itself.  This  is  done  in  order  to 
enhance  the  chances  for  KNO-rcconstniction  when  a  dis¬ 
continuity  is  present  in  between  the  gravity  centers  of  the 
boundary  cell  and  its  interior  neighbour. 

Note  that  except  for  the  central  supports,  no  boundary 
cells  arc  ever  allowed  in  the  support  since  having  a  bound¬ 
ary  cell  in  the  support  close  to  a  discontinuity  may  lead 
to  instabilities.  Indeed,  a  boundary  cell  which  was  ill  a 
smooth  region  may  suddenly  be  in  a  non-smooth  /one 
after  applying  the  boundary  conditions  which  lead  to 
a  change  in  the  solution  value  stored  ir.  the  boundary 
cell.  When  calculating  afterwards  the  flux  balances  in 
the  interior  domain,  the  reconstruction  in  cells  close  to 
the  boundary  will  no  longer  be  ENO;  hence,  large  os¬ 
cillations  will  occur.  Thi3  could  eventually  be  cured  by 
selecting  new  supports  after  updoing  the  boundary  cells 
but  this  would  be  far  loo  costly. 


0.4  Examples 

In  the  following,  some  reconstructions  of  known  polyno¬ 
mials  with  a  discontinuity  are  presented  here.  Only  ex¬ 
amples  using  the  CIC-HOR-algorithm  combined  with  thr 
SSM-Algorithm  are  givc.i  as  similar  results  were  obtained 
with  the  ZM-HOR-algorithm. 

The  next  set  of  functions  were  used  to  demonstrate  the 
monotonicity  of  the  reconstruction: 


Pi(r.y) 

=  li  + 

i’i(r.y) 

-  3r3  + 

w 

+ 

2/y 

+  2* 

+  V 

P}(*,y) 

=  3*J  + 

V 

•f 

(*  +  .v)2 

i\{r,y) 

=  t  ',v  •- 

2.U4 

- 

•t*  y 

r  ■ 

■i-  V 

if  y  >  tan(30").(j-  —  1)  and 


(153) 


reconstructed  exactly  (/\  was  also  the  function  to  detect 
discontinuities). 

Only  the  shape  of  the  discontinuity  is  not  exact.  One 
sees  that  it  follows  exactly  the  grid  lines  instead  of  being 
a  straight  line.  There  arc  methods  to  cure  this  problem 
like  for  instance  the  subccll  resolution  technique  proposed 
by  llartcn  ct  al.  (ref.  [38]),  see  also  section  7.3 

In  the  following,  the  reconstruction  of  the  exact  solution 
of  the  following  problem  is  discussed: 

q,  +  ?*  +  (y)  *  0  (155) 


with 


q  =  +i 

on 

9  - 

0 

q  -  -1 

on 

y  = 

,1. 

q  =  -2 y  h 1 

on 

X  — 

0 

Equation  155  refers  to  a  convection  problem  whose  con¬ 
vection  speed  is  given  hy  (1 ,  q).  The  solution  is  equal  to 
-1  on  the  lower  half  of  the  domain  and  equal  to  -f  1  on 
the  upper  half.  In  the  triangular  /one  in  the  left  half  the 
solution  is  given  hy: 

«(*.  y)=5~TY  U56) 

and  varies  linearly  along  'ines  where  r  =  etc.  At  the 
borders  of  the  triangular  /one  one  observes  therefore  a 
di.  continuity  in  the  first  derivatives  of  q  and  a  disconti¬ 
nuity  along  the  line  y  —  0.5  if  i  >  0.5.  Fig,  42  shows 
the  isolincs  of  the  quadratic  reconstruction  of  the  exact 
solution  q  using  the  GC-1IOR- Algorithm  combined  with 
the  SSG-Algorithm.  The  discontinuity  along  y  =  0.5  is 
not  visible,  For  reasons  of  comparison,  fig.  43  shows  the 
isolincs  obtained  when  performing  a  linear  interpolation 
between  the  gravity  centers  of  the  grid.  The  discontinu¬ 
ity  takes  now  the  thickness  of  one  cell.  Fig.  44  shows  a 
3D  view  of  the  solution  over  the  two  dimensional  domain 
whereby  each  ”tile’'  in  the  plot  represents  the  reconstruc¬ 
tion  polynomial  in  each  grid  cell.  The  discontinuity  is 
infinitely  sharp  but  again  follows  the  grid  lines. 

Using  the  SSM-Algorithm  in  order  to  obtain  an  ENO- 
rcconstrnction,  the  supports  are  allowed  to  cross  the  dis¬ 
continuity  In  the  first  derivative  at  the  horder  of  the  tri¬ 
angular  zone.  Indeed,  when  marching,  one  takes  fitst  the 
neighbours  forming  the  smallest  first  order  derivative.  In 
II),  this  can  lead  to  the  situation  given  in  fig.  45. 


P\  (i.y)  = 

It  -  \y  -  5 

Pt(i,  y)  - 

-5r3  -t-  <iy3  —  2rt) 

-  r  >i  t  5 

I’At.y)  = 

«(/  -  3)J  -  5y  ’ 

—  (*  T  y)3  +  to 

-8(x  -  3)"'j/  +  4y* 

I  -  2r  -  y  -  30 

otherwise.  Only  some  results  for  functions  Pi  and  l\  will 
be  presented  as  analogue  figures  were  obtained  for  the  lin¬ 
ear  and  quadratic  functions.  Fig.  3t>  shows  a  quadratic 
reconstruct  ion  of  the  cubic  function  whereby  this  func¬ 
tion  was  also  used  to  detect  discontinuities. 

Fig.  40  presents  the  support  for  a  number  of  cells  close 
to  the  discontinuity  and  in  the  smooth  region.  It  is  ob¬ 
served  that  the  supports  tend  to  align  themselves  with 
the  isolincs. 


It  is  apparent  that  first  neighbour  1  of  cell  P  will  be 
taken.  The  inarching  procedure  will  then  continue  on 
the  side  with  the  smallest  gradient  and  as  cell  P  also 
belongs  to  the  support,  the  reconstruction  will  oscillate 
close  to  cell  P.  Marching  support  selection  algorithms 
should  therefore  be  used  with  care. 


Finally,  fig.  41  gives  the  isolines  4,h  order  ENO- 
rcconstruction  of  function  Pi  demonstrating  that  both 
polynomials  on  either  sides  of  the  discontinuity  can  he 


Other  two  dimensional  support  selection  algorithms  are 
given  in  the  literature,  Abgrall  [39]  describes  an  algo¬ 
rithm  for  triangular  meshes  which  is  similar  to  the  SSM- 


Fig.  39:  Isolines  of  function  Pj  with  quadratic  ENO- reconstruction. 


Algorithm.  Harten  [1C]  presents  another  marching  al¬ 
gorithm  which  mathematically  minimizes  the  higher  or 
dor  derivatives.  He  also  proposes  a  fixed  central  support 
scheme  for  problems  without  discontinuities.  This  choice 
is  justified  by  the  fact  that  central  supports  are  most 
accurate  when  reconstructing.  An  automatic  switch  be¬ 
tween  constant  solution  reconstruction  and  higher  order 
reconstruction  is  introduced  when  solving  problems  with 
weak  shocks  (hybrid  reconstruction).  This  will  of  course 
not  load  to  uniformly  higher  order  schemes. 

Shu  and  Oshcr  [14]  report  on  the  effect  of  the  choice  of 
lire  first  neighbouring  support  cell  on  the  stability.  They 
developed  a  support  selection  algorithm  which  takes  the 
first  neighbour  support  cell  always  on  the  upwind  side. 
Finally,  it  is  sometimes  suggested  to  keep  the  supports 
fixed  in  smooth  zones  in  order  to  reduce  the  computa¬ 
tional  cost.  However,  it  still  remains  to  verify  that  such 
schemes  are  stable  since  tne  solution  can  start  to  oscillate 
also  in  smooth  zones  because  the  scheme  is  not  TV1). 
This  means  that  the  supports  will  have  to  be  adapted 
which  increases  again  the  computational  cost. 


I'ig.  44:  Nonlinear  scalar  convection  problem,  3D  view  of  tl>e  quadratic 
construction  of  the  exact  solution  (GC-1IOR  -f  SSG). 


re- 
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7  SYSTEMS  OF  EQUATIONS 


7.1  Introduction 


Until  now  only  scalar  problems  were  tackled  but  the  final 
aim  is  to  solve  a  system  of  equations  such  as  the  Ruler 
equations.  Although  the  extension  towards  systems  of 
equations  is  rather  straightforward,  some  additional  top¬ 
ics  are  particularly  interesting  when  solving  the  Euler 
equations  (2), (3), (4).  This  section  will  treat  the  recon¬ 
struction  phase  when  solving  systems  of  equations  and 
also  some  recently  developed  concepts  for  sharpening  dis¬ 
continuities. 


7.2  Reconstruction 


(n  the  case  of  the  two  dimensional  Euler  equations,  one 
is  .tow  dealing  with  a  solution  vector  Q  of  4  solution 
variables.  Two  approaches  arc  now  possible  [16],  the 
component-wise  vuppc.t  selection  and  the  vector  support 
selection . 


The  component-wise  support  selection  assigns  a  support 
to  each  component  of  the  solution  vector.  It  increases 
the  work  spent  in  the  support  selection  algorithm-  by  a 
factor  4  but  seems  to  be  more  robust  when  several  mov¬ 
ing  shocks  intersect  in  time-dependent  problems.  Note 
that  in  this  case  there  is  not  always  enough  room  to  find 
a  support  in  which  no  discontinuities  occur  and  hence 
spurious  oscillations  can  be  generated. 

The  vector  support  selection  assigns  one  and  the  same 
support  to  all  4  solution  variables.  The  selection  is 
performed  on  basis  of  the  smoothness  of  a  single  (de¬ 
rived)  variable  such  as  the  Mach  number,  static  pressure, 
density,. . .  This  approach  is  considerably  more  economic. 

Concerning  the  robustness  of  a  scheme  to  be  used  for 
the  Euler  equations  it  is  important  to  remind  that  the 
density  p,  static  pressure  p  and  static  temperature  T  are 
uonnegative  quantities.  Reconstruction  of  the  conserva¬ 
tive  variables  in  an  ENO-fashion  gives  a  direct  control 
over  the  density  but  not  over  the  derived  quantities  p 
and  T.  Small  oscillations  in  the  reconstruction  of  trie 
conservative  variable  close  to  multiple  discontinuity  in¬ 
teractions  lead  in  general  to  larger  oscillations  in  p  and  T. 
When  computing  these  quantities  at  a  cell  interface  using 
the  local  values  of  the  reconstruction  of  the  conservative 
variables,  one  can  get  a  negative  value  as  a  consequence 
of  which  the  solution  procedure  is  to  be  stopped. 

As  the  characteristic  variables  vary  more  smoothly  than 
the  conservative  variables  at  discontinuity  interactions, 
it  is  often  suggested  in  the  literature  to  perform  a  local 
transformation  to  tne  characteristic  variables.  One  can 
then  compute  all  derived  variables  from  ihc  component- 
recor.strucf'or  of  the  characteristic  variables  which 
remain  smooth  at  discontinuity  interactions.  The  trans¬ 
formation  matrix  contains  the  eigenvectors  of  the  Jaco¬ 
bian  ni  itrix  of  the  flux  normal  to  the  discontinuity. 
Another  approach  is  to  reconstruct 
the  rector  (/>,  pu,pv,p)T.  This  gives  again  direct  con¬ 
trol  ov.-r  the  density  and  the  pressure  and  moreover,  the 
normal  momentum  and  p  remain  smooth  across  contact 
discontinuities. 

7.3  Subcell  Resolution 

In  ll),  Kartell  [38]  proposed  a  method  to  resolve  discon¬ 
tinuities  more  sharply.  A  discontinuity  is  detected  by: 

1,  inspecting  the  norms  IT,,  <r,+i ,  <r,_i  of  the  derivatives 
of  the  reconstruction  ii  cell  i  and  its  neighbouring 
cells  i  -f  1  and  i  -  1,  fig.  46. 

2.  comparing  the  mean  of  the  reconstruction  polyno¬ 
mials  Q[1C|°  and  Q*+C\°  assigned  to  the  neighbours 
i  —  1  resp.  »  +  1  with  the  given  discrete  mean  value 
<  Q  >,  in  cell  P 

A  discontinuity  is  assumed  when: 


<Ti  >  O', -I 

<T,  >  <r,.fj  (157) 

and 

(< >'-<Q >’)  (< Qr,e-° >’  -  < Q >') <o 

(158) 

in  which  <  >’  and  <  Qil\3  >*  denote  the  mean 

of  the  reconstruction  jiolyroiiiial  of  cell  »  +  1  reap.  »  —  1 
over  cell  «.  The  reconstruction  in  cell  •  will  then  consist 
of  the  extension  of  the  reconstruction  polynomials  of  the 


Q 


i 


Fig.  46:  ID  discretization  around  a  discontinuity. 

neighbours  into  the  cell  «,  fig.  47.  The  interface  between 
the  two  polynomials  is  located  at  a  position  x,  such  that: 

*,(0)  =  *,  +  0(z;+i  -  z.)  (159) 

with  9  the  solution  of  the  next  nonlinear  equation  stating 
that  the  mean  in  the  cell  i  must  be  conserved: 

<  Q\l\°  +  <  Q:\\°  >j‘,+  1=<  Q  >,  (160) 

in  which  <  Q"e,°  >'■’  and  <  QJ+i°  >J,+  I  indicate  the 
mean  of  the  reconstruction  polynomial  of  cell  i  -  1  over 
the  interval  [i,y]  resp.  the  mean  of  the  reconstruction 
polynomial  of  cell  t  +  1  over  the  interval  \j,i  +  1]. 
Knowing  6,  a  scheme  can  be  constructed  while  introduc¬ 
ing  a  corrective  flux  [38). 


Fig.  47:  ID  reconstruction  with  subcell  resolution. 

A  2D  extension  might  consist  of  detecting  the  disconti¬ 
nuities  based  on  the  conditions  (146)  and  whereby  the 
normal  direction  of  the  discontinuity  at  a  cell  P  is  indi¬ 
cated  by  the  mean  gradient  vector  over  all  local  triangles 
around  P,  fig.  31  and  fig.  48.  The  polynomials  of  the  two 
opposite  direct  neighbours  having  their  centroid  closest 
to  the  normal  through  the  ceutroid  of  P  will  be  extended 
into  cell  P.  The  interface  line  between  the  two  polyno¬ 
mials  is  parallel  to  the  discontinuity  and  its  location  is 
computed  while  requiring  that  the  mean  over  the  cell  P 
be  conserved. 

7.4  Slope  Modification  Method 

Yang  [40]  proposed  to  add  a  modification  to  the  coeffi¬ 
cients  of  the  linear  terms  in  the  reconstruction  polyno¬ 
mial.  This  modification  is  called  a  slope  modifier  and  is 
a  ir.inmod-like  function  depending  on  the  jumps  in  the 
unmodified  reconstruction  polynomials  at  the  cell  inter¬ 
faces.  The  final  effect  is  that  one  added  artificial  com¬ 
pression  to  the  scheme. 

Yang  produced  impressive  results  for  both  ID  and  2D 
problems.  Moreover,  his  method  is  less  costly  than  the 
subcell  resolution  approach.  For  a  detailed  comparison 
between  the  subcell  and  the  slope  modification  approach, 
the  reader  is  referred  to  the  work  of  Shu  and  Oshcr  [14]. 


Fig.  48:  Subcell  resolution  in  2D. 


8  NUMERICAL  EXPERIMENTS 

8.1  Introduction 

Results  will  be  presented  for  linear  scalar  problems,  non¬ 
linear  scalar  problems  and  an  Euler  flow  problem.  Alt 
results  were  obtained  with  an  object-oriented  code  writ¬ 
ten  in  C+4-  in  order  to  have  an  enhanced  flexibility  when 
testing  and  adding  new  algorithms.  Comparison  of  the 
code  with  an  existing  FORTRAN  code  when  using  ex¬ 
actly  the  same  first  order  scheme  on  the  same  mesh  did 
not  reveal  any  loss  of  efficiency  due  to  the  object-oriented 
implementation. 

8.2  Linear  Scalar  Problems 

In  this  section,  the  verification  of  polynomial  preservation 
will  be  addressed.  Consider  now  the  following  problem 
on  a  rectangular  domain  ft,  see  also  fig.  26c: 

fl.^?  =  S(x,y)  with  a  -(2,1)  (161) 

This  problem  was  studied  with  4  different  source  terms 
S  given  by: 


Si(*,y)  = 

x  +  2y  -  3 

Si(x,y)  - 

x  +  2y  —  3 

S3(x,y)  = 

v'  + 33*2  - 

5«(z,y)  = 

8xs  +  80j/3  +  2-ixy5  -  68 

The  exact  solution  to  the  problem  (161)  is  then  given  by 
following  4  polynomials: 


=  2x  +  3y  +  2 

ft(*.y) 

=  ~x3  +  xy 

"  A*#  +  «**  +  £** 

-  yyJ  -  jzy  + 

P«(*.y) 

=  z*  +  16y4  +  8xy3 
-  34r  +  10 

which  are  the  same  polynomials  as  those  in  eq.  (94)  of 
section  4.  These  solutions  can  of  course  only  be  obtained 
while  imposing  proper  boundary  conditions. 

Tests  were  performed  using  both  the  GC-HOR  and  ZM- 
HOR-Schemes.  In  the  following,  one  will  discuss  how 
polynomial  preservation  was  achieved  for  the  two  algo¬ 
rithms. 
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For  the  GC-HOR-schemc,  tile  exact  solution  q  is  given  in 
the  gravity  centers  of  cacti  cell  : 

Qp  =  l(*oP,  J/G,,) 

while  for  the  ZM-HOR-scheine  it  is  given  as  its  exact 
mean  value  over  each  cell: 

<  Q  >P=  J J  g(x,y).rfll, 

for  both  interior  and  boundary  cells. 

It  was  then  numerically  observed  that  the  imposed  exact 
polynomial  solution  q  of  degree  r  was  not  touched,  i.e. 
the  residuals  are  within  the  round-olf  error  of  the  machine 

(10-**),  if 

•  the  exact  flux  integrals  are  imposed  at  the  bound¬ 
aries  where  the  convection  vector  a  is  pointing  into 
the  domain. 


•  the  order  of  the  reconstruction  is  at  least  equal  to 
the  degree  r  of  the  polynomial  exact  solution 

•  only  the  exact  solution  value  is  imposed  at  the  grav¬ 
ity  centers  of  the  boundary  cells  and  not  the  exact 
flux. 

•  the  source  term  is  discretized  as  follows: 

i>(xa^,yaP)  ■  ftp 

•  only  one  (1!)  Gauss  point  is  used 

•  a  non-conservative  flux  distribution  is  applied.  This 
means  that  each  edge  is  visited  twice  but  each  time 
seen  from  the  other  side.  Thus: 

Qr+1  =  Qp-  Fpr 
Qr  1  =  Ql-Fnp 

whereby 


•  the  same  treatment  as  for  interior  edges  is  applied  at 
boundaries  where  the  convection  vector  points  out 
of  the  domain.  This  is  correct  as  the  value  in  the 
boundary  is  not  used  when  calculating  the  flux.  In¬ 
deed,  the  upwind  approximate  Riemann  solver  will 
only  take  the  value  in  the  interior  neighbour  into 
account. 

•  the  source  term  must  be  averaged  exactly  over  each 
cell,  see  ?Jso  cq.  (120): 

D{S)  =  S(t,  y).dtt 

•  the  order  of  the  reconstruction  it  is  at  least  equal  to 
the  degree  r  of  the  exact  polynomial  solution 

•  the  number  of  Gauss  points  «  satisfies  the  next  con¬ 
dition: 


where  k  is  the  order  of  the  reconstruction. 

Following  theorem  5.2,  this  experiment  can  be  regarded 
as  a  numerical  verification  of  the  accuracy  in  the  interior 
domain  of  the  GC-HOR  and  ZM-HOR  scheme. 

Fig.  49  shows  the  quadratic  reconstruction  of  the  nu¬ 
merical  solution  obtained  using  a  GC-SSG-schcme  with 
quadratic  reconstruction,  a  2  point  Gauss  quadrature 
(GP=2)  along  the  edges  and  the  source  term  S* .  The 
linear  interpolation  of  the  same  solution  is  presented  in 
fig.  50  which  is  to  be  compared  with  the  exact  solution 
shown  in  fig.  25.  Note  that  using  quadratic  reconstruc¬ 
tion  for  a  polynomial  solution  of  degree  4  docs  not  pre¬ 
serve  the  solution  since  the  order  of  reconstruction  does 
not  match  the  degree  of  the  solution.  The  grid  one  used 
is  a  randomized  rectangular  mesh  containing  296  cells, 
fig.  26c.  The  randomizing  of  the  grid  points  is  necessary 
to  avoid  a  contradictory  system  for  the  reconstruction 
weights  as  discussed  in  subsection  4.5. 

The  solution  was  extracted  after  100  iterations  and  took 
90  minutes  of  CPU  on  an  SGI  Iris  4D/35  workstation, 
the  final  residual  is  of  the  order  10_1. 

Remark  that  another  HOR-scheme  can  be  devised 
whereby  the  solution  in  each  cell  is  represented  by  only 
the  linear  part  of  the  higher  degree  (fc)  reconstruction 
polynomial.  Polynomial  preservation  for  such  a  LINEX 
scheme  (LINear  Extrapolation)  is  then  obtained  if: 


Fpn  =  ^  a.iipn.Qp 

n 

Frp  —  ^2  a-HRP-QJi 

p 

in  which  Qp  and  QJt  are  the  extrapolated  values 
at  the  cell  edges  while  using  only  the  linear  terms 
of  the  reconstruction  polynomial.  It  is  clear  that 
the  fluxes  vent  to  the  two  cells  adjacent  to  the  same 
edge  will  not  be  identical  so  that  the  scheme  is  not 
conservative. 

•  at  boundaries  where  the  convection  vector  points 
outward  the  domain,  the  update  of  the  boundary 
cell  P  is  done  as  follows: 

Qr+1  =  Q(n\rp) 

where  the  superscript  (Jfc)  indicates  that  all  terms  ol 
the  reconstruction  polynomial  in  the  interior  neigh¬ 
bour  R  arc  taken  into  account.  For  the  computation 
of  the  flux  contribution  lo  the  interior  neighbour  stiM 
only  the  linear  terms  arc  used. 

The  LINEX  scheme  can  become  economic  for  reconstruc¬ 
tions  of  order  higher  than  or  equal  to  4  since  then  nor¬ 
mally  one  would  need  at  least  3  flux  evaluations  per  edge 
instead  of  2  in  the  case  of  the  LINEX  scheme.  Also 
the  evaluation  of  the  source  term  is  cheaper  as  one  just 
has  to  take  its  value  at  the  gravity  center  (even  if  ZM- 
rcconstruction  is  used).  However,  as  wc  arc  interested 
in  capturing  discontinuities  co.rectly,  the  LINEX  scheme 
L  of  few  importance.  Note  that  other  I'.on-conservative 
schemes  with  reduced  grid  sensitivity  arc  described  in  the 
literature,  see  e.g.  [9], 

8.3  Nonlinear  Scalar  Problems 

In  this  section,  the  numerical  solution  of  the  nonlinear 
problem  of  eq.  (155)  in  section  6  will  be  discussed.  The 
definition  of  the  problem  is  repeated  here: 

It  +  q*  +  =  0  (162) 

with 

q  -  +1  on  y  =  0 

q  =  —  1  on  y  =  1 

q  =  — 2y  +  1  on  z  =  0 
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Fig.  49:  Reconstruction  of  the  solution  of  the  linear  problem,  (  Jt=2,  GP=2, 
GC-SSG). 


Fig.  50:  Linear  interpolation  of  the  solution  of  the  linear  problem,  (  fc=» 2, 
GP=2,  GC-SSG). 


The  exact  solu.ic  is  given  in  fig.  42  and  43.  The  com¬ 
putation  started  worn  the  exact  solution  using  the  GC- 
IIOll  reconstruction  (*=2)  with  SSG  support  selection, 
The  analytic  flux  is  given  by: 

2 

II  =q.nt +  ~.ny  (163) 

The  numerical  fluxes  Hpo  through  the  edges  were  com¬ 
puted  using  a  Roc-type  flux  splitter  whereby  C,„  in 
cq.  (8)  is  now  given  by: 

r>  .  vfqt  Who  ..  rina'r 

Cm  —  r  ^  nvt>n  (lb  v 

A  two  point  Gauss  quadrature  (GP=2)  was  applied, 
meaning  that  the  Riemann  solver  was  invoked  twice  for 
each  edge. 

The  exact  solution  was  kept  fixed  at  the  upper,  lower 
and  left  boundaries  of  the  computational  domain  while 
the  reconstructed  solution  in  the  interior  cells  at  the  right 
boundary  was  copied  into  the  boundary. 

The  quadratic  reconstruction  of  the  numerical  solution 
as  well  ■”  the  mesh  is  shown  in  fig.  51  while  the  linear 
interpolation  of  the  numerical  solution  on  the  dual  mesh 
is  presented  in  fig.  52.  These  t  wo  figures  can  be  compared 
with  fig.  42  resp.  43.  A  three  dimensional  view  of  the 
reconstruction  of  the  obtained  numerical  solution  is  given 
in  fig.  53  which  can  be  compared  with  fig,  44, 

It  is  seen  that  the  discontinuity  is  captured  in  one  cell 
and  that  no  oscillations  occur.  This  is  put  in  evidence  in 
fig.  54  and  55  shoving  a  cut  of  the  solution  at  x  =  0.3 
resp.  t  =  0.95. 


The  solution  shown  here  was  obtained  after  100  Euler 
explicit  time  steps  with  a  CF /.-number  equal  to  0.1.  The 
convergence  history  is  given  in  fig.  56  showing  the  RM  S- 
valuc  over  the  whole  domain  of  the  relative  change  of  the 
solution  values  over  each  time  step.  The  total  CPU  cost 
of  the  calculation  was  about  3  hours  on  an  SGI  Iris  4D/35 
workstation  which  means  about  0.16  seconds  per  cell  per 
iteration. 

8.4  Ringleb  Flow  in  a  Bended  Channel 

The  results  presented  in  this  subsection  were  computed 
using  a  fixed  stencil  and  are  to  be  considered  as  prelinii- 
nar  in  particular  with  respect  to  the  boundary  condition 
treatment. 

8.4.1  Problem  Description 

The  Ringleb  flow  [41]  is  basically  a  potential  flow  in  a 
strongly  bended  duct  and  can  be  described  analytically. 
A  proper  choice  of  the  shape  of  the  duct  guarantees  that 
the  flow  remains  everywhere  subsonic,  sec  fig.  57  taken 
from  [42]. 

At  any  point  in  the  domain,  the  flow  is  characterized 
by  two  parameters  q  and  k.  The  parameter  q  is  a  non¬ 
dimension  alized  speed  and  is  constant  along  circles  which 
therefore  become  the  isotach  lines.  The  parameter  k  is 
equal  to  where  6  is  the  angle  of  the  local  flow  direc¬ 
tion.  One  can  proof  that  A:  is  a  constant  along  stream¬ 
lines. 

Taking  now  the  streamline  with  k  =  0.8  as  the  inner  wall 
of  the  bended  duct  will  result  in  a  fully  subsonic  flow.  The 
outer  wall  coincides  with  the  streamline  for  which  k  — 
0.4.  The  in-  and  outlet  of  the  duct  arc  pavt  of  the  isotach 
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Fig.  51: 


Reconstruction  of  the  computed  solution  of  the  nonlinear  scalar  con-  f 

vcction  problem,  (fc=2,  GP=2,  GC-SSG).  j 


Fig.  52: 


Linear  interpolation  of  the  computed  solution  of  the  nonlinear  scalar 
convection  problem,  (i— 2,  GP=2,  GC-SSG). 


i 
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Fig.  54:  Solution  cut  U  t  =  0.3  (k=2,  GP— 2,  GC- 
SSG). 
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Fig.  55:  Solution  cut  at  *  =  0.95  (fc=2,  GP=2,  GC- 
SSG). 


line  with  q  =  0.3.  The  mesh  usc<l  on  this  geometry 
is  depicted  in  fig.  58  and  contains  about  600  triangular 
cells  generated  by  a  code  written  by  J.D.  Muller  at  the 
University  of  Michigan  and  which  uses  a  combination  of 
Delaunay  and  frontal  methods,  set'  ref.  [43],  The  flow 
turns  from  left  to  right. 

8.4.2  Scheme 

A  fixed  central  support  was  used  in  the  calculation  be¬ 
cause  of  its  enhanced  accuracy  in  the  reconstruction 
phase  (see  (16])  and  because  of  the  absence  of  discon¬ 
tinuities  in  the  flow.  The  solution  in  the  form  of  conser¬ 
vative  variables  was  rcconstruc'  cd  using  a  vector  recon¬ 
struction;  i.e.  all  variables  use  one  and  the  same  sup¬ 
port.  The  values  of  the  second  order  Zero- Mean  recon¬ 
struction  on  both  sides  of  each  cell  edge  were  passed  into 
Van  f.eer’s  flux  splitter  to  evaluate  the  (non-linear)  flux 
through  the  current  edge  using  one  point  (GP=1)  Gauss 
quadrature.  The  CFI.-numbcr  is  0.2  combined  with  a  4 
step  Kungc-KuUa  time  stepping  with  the  following  coef¬ 
ficients  :  (i,  i,  l)  . 

8.4.3  Boundary  Conditions 

The  components  of  the  unit  vector  tangent  to  the  ex¬ 
actly  known  flow  direction  are  imposed  at  the  inlet.  Note 
that  these  components  vary  along  the  inlet  .  Besides  this, 
also  the  total  pressure  (1.2  bar)  and  the  total  temper- 
atii'o  (298  K)  arc  needed  in  order  to  treat  a  subsonic 
inlet  boundary  condition.  Remark  that  the  flow  is  in- 
viscid  and  subsonic  as  a  consequence  of  which  there  arc 
no  losses.  Therefore,  the  total  pressure  and  temperature 
can  be  assumed  constant  along  the  inlet.  Zero  normal 
velocity  is  required  at  the  solid  walls.  Although  this  con¬ 
dition  is  sufficient  to  perforin  a  correct  boundary  condi¬ 
tion  treatment,  also  a  constant  total  pressure  (1.2  bar) 
was  imposed  on  the  walls.  This  type  of  treatment  was 
proposed  by  Denton  in  [44]  and  leads  to  a  more  accurate 
solution  at  the  solid  boundaries  when  a  fully  reversible 
or  isentropic  flow  is  present.  As  the  speed  is  constant  at 
the  outlet  as  well  as  the  total  pressure,  r.  constant  static 
pressure  (1.125  bar)  was  applied  at  the  outlet.  This  pres¬ 
sure  corresponds  with  the  imposed  total  pressure  at  the 


inlet  and  with  the  parameter  q  =  0.3. 

8.4.4  Solution 

The  exact  solution  presented  on  the  mesh  of  fig.  58  using 
a  3rd  order  reconstruction  (Is— 3)  is  given  in  figures  59 
and  60 

The  velocity  lines  obtained  from  the  solution  are  shown 
in  the  plot  of  fig.  61. 

It  is  clear  that  the  tachlincs  arc  no  perfect  circles  in  the 
entire  domain.  However,  the  plot  of  constant  density  lines 
given  in  fig.  62  is  comparable  to  the  solution  presented 
by  Barth  in  fig.  63  obtained  from  ref.  [45], 

A  further  comparison  between  the  density  isolincs  of  the 
numerical  solution  and  the  exact  solution  (fig.  62  and  59) 
shows  a  rather  good  agreement.  The  relative  /,j-norm  of 
the  error  as  computed  with  eq  (96)  between  the  numer¬ 
ical  solution  and  the  exact  one  over  the  whole  domain  is 
of  order  1%  for  the  density.  Fig.  64  presents  the  local 
relative  7,j-norm  of  the  error. 

The  largest  errors  are  made  at  the  inner  wall  of  the  duct 
where  the  higher  order  derivatives  of  the  exact  solution 
are  higher  and  hence  cannot  be  ma'.chcd  exactly  by  a 
quadratic  reconstruction.  Another  reason  is  that  the 
exact  boundary  is  approximated  by  a  series  of  straight 
boundary  edges  of  more  or  less  constant  length.  This  ap¬ 
proximation  is  of  course  less  accurate  where  the  curvature 
of  the  boundary  is  higher. 

To  shew  the  improvement  achieved  with  quadratic  re¬ 
construction  with  respect  to  a  HOR-schcmc  using  linear 
reconstruction,  fig.  65  shows  the  tachlincs  obtained  when 
using  linear  reconstruction  and  1  Gauss  point. 

The  underlying  incsli  is  much  more  visible  in  the  ease  of 
linear  reconstruction.  The  relative  /.j-norm  of  the  den¬ 
sity  error  ove,  the  entire  domain  is  about  1.5%  compared 
to  the  1.0%  obtained  with  quadratic  reconstruction. 
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A  Proofs  of  Higher  Accuracy 

Proof  of  Theorem  5.1  (A): 

To  proof  order  hk  space  accuracy  one  has  to  investigate 
the  local  truncation  error.  The  truncation  error  is  un 
par*  in  fhr  -nuivalcnt  differential  equation  that  is  not 
present- in  the  analytical  modelling  equation  to  be  solved. 
To  obtain  the  equivalent  differential  equation,  the  un¬ 
knowns  in  the  stencil  cells  of  the  employed  scheme  arc 
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developed  in  a  Taylor  series  around  some  point  fP  as¬ 
signed  to  the  cell  where  the  truncation  error  is  to  be 
evaluated,  see  [30]  and  [1'2].  The  differential  equation  is 
tV  equation  which  is  satisfied  by  the  exact  solution  q  in 
a  discrete  set  of  points  rp  around  which  the  Taylor  ex¬ 
pansions  have  been  made.  As  yet,  the  choice  of  the  point 
rp  remains  arbitral  .'. 

In  this  pioof,  the  procedure  deviates  a  little  front  the 
classic  or.c  given  in  [30]  and  [12]  in  that  it  first  usca  the 
theorems  4.1  and  4.3  before  applying  Taylor  series  expan¬ 
sions.  Thus,  uung  eqs.  (92)  or  (93)  and  assuming  that 
the  excct  solution  a  of  the  equivalent  differential  equation 
varies  smoothly,  the  space  discretization  of  scheme  (101) 
can  then  be  rewritten  as: 

[l>(S3q)]  (?)  = 

^■EE{4.[{(ro)+0(k‘+l)] 

+  '»;B-[9(ro)  +  0(A*'l‘)j}.Aso 

which,  m  Ls  turn,  is  equal  to 

[*>(*#1)]  («)  = 

“ (<i.nj>n)  ?(ro).Aao  +  0(/t')f>J+1 

p  n  o 

The  value  of  q(ro)  can  now  be  replaced  by  a  Taylor  series 
expansion  around  the  reference  point  rp  while  truncating 
it  aRer  the  the  terms  of  order  k\ 


[/)(«.  V?)j  (?)  («.«pb)  • 

*  1 

qp  +  ^2  jic? (?o).*ir  -Aso 
.  i=' 

+  C(hk)  7>i+1 


where  the  definition  of  Of  is  given  in  cq.  (90)  while  the 
one  of  r)q /  is  simitar  to  dQf  as  found  in  cq.  (69)  but  now 
applied  to  the  exact  solution  </  of  the  numerical  scheme. 

As  ( a.npp )  is  constant  along  each  straight  edge,  the 
previous  expression  for  the  space  discretization  can  be 
rewritten  as  a  continuous  contour  integral  if  the  number 
of  Gauss  points  O  on  the  edges  is  larger  than  or  equal  to 

2  : 

[/>(d.tf,)|  (,)  = 

if  *  ] 

Or ‘Ten  qi>  +  ^  ^!C'  ds 


+  0(hk)DkSl 


[D(a.^?)J  (?)  =  ^  (SM).q.ds  +  0(hk)'Dk+> 

Using  Green's  theorem,  one  finds  finally  that  the  space 
discretization  satisfies  the  following  equation: 

|/)(a.tf?)]  (?)  =  <  a.tf?  >r  +  0(hk) 2>J+1  (165) 
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which  completes  the  first  proof  for  order  hk  space  accu¬ 
racy.  Note  also  that  one  is  discretizing  mean  values  over 
a  cell  even  when  the  GC-IIOR -Algorithm  is  used.  1‘oir.t 
values  arc  discretized  only  if  the  order  of  the  reconstruc¬ 
tion  k  <  1. 

D 


Proof  of  Theorem  5.1  (B): 

The  second  proof  follows  rigorously  the  classic  truncation 
error  analysis  as  described  in  [30]  and  [12],  This  means 
that  all  solution  values  in  thejsupport  cells  have  to  he  ex¬ 
panded  as  well  in  the  vector  Q  or  Q  as  defined  in  eq.  (30) 
resp.  (80). 

The  proof  will  only  be  given  here  for  the  scheme  us¬ 
ing  ZM-HOR  extrapolation  since  the  reasoning  fo'  the 
GC-IIOR  scheme  is  completely  similar  and  leads  to  the 
same  conclusions.  The  proof  will  rely  heavily  cn  the 
properties  of  the  reconstruction  weights  as  expressed  in 
cqs.  (6-1),  (03),  (81)  and  (83). 

To  examine  the  local  truncation  error,  the  expres¬ 
sions  (104)  and  (105)  have  to  be  developed  in  a  Taylor 
scries  and  plugged  into  eq.  (101).  The  Taylor  expansions 
of  the  mean  of  the  solution  in  cell  P  and  a  cell  R  next  to 
cell  P  are  given  by  the  expressions  (71)  resp.  (72).  The 
reference  point  rp  in  these  expressions  is  the  same  as  the 
one  used  to  compute  the  inertial  moments  when  recon¬ 
structing  the  solution,  eq.  (33).  The  properties  expressed 
by  eq.  (64)  can  then  immediately  be  exploited  when  ap¬ 
plying  eq.  (71)  to  the  scheme  given  by  eq.  (101)  with  g 
the  exact  solution  of  the  equivalent  differential  equation: 


I'Cn  [?'■  +  E  jic'1  Col-aiF 

L  +  E  Aif'W 
+  e 

+  0(h‘)v;*‘  (16?) 

Keeping  in  mind  eq.  (64),  it  is  desirable  to  rewrite  rn 

in  terms  of  lRt,R  with  0  <  j  <  l  and  0  <  i  <  in  and  R.'  a 
cell  belonging  to  the  support  of  cell  R.  This  can  be  done 
using  the  next  transformation  formula: 


,H',P 
m.m 

t  —  r>  •  tn 


j=0 1=0  X  /  v  / 


(168) 


Using  eq.  (168)  and  taking  i\to  account  the  properties 
given  by  eq.  (64),  one  finds  after  some  manipulations 
that: 

With  these  results  the  space  discretization  becomes: 


’’  n  o 

{•;■«  [?.-+E  ^ 

+n7ii  [./I-  •  E  Tr1! ,f  -*1 1 
L  <=i 

+  w*-(s<-  +  EKP-<)-*"]} 

+  0{hk)Pk+l  (166) 

where 


K*>]<»>-?feEX>o. 


{«!.«• 


*»>>«•  qp  +  rCi  (ro)  d<li 


i=i 


+  apn' 


Qp  +  Y2  7iCf  (,ro)-d< »r } 

L  1=1  J 


-K?(h*)f>$+1 


+0(h*)p; 


k\nk* 1 


+  E  jjc'  if 


.A  so 


<ip  =  [qp  ••■<?*>] 


and 


p  p 


I 


( 


lt 


r 


with  W,  a  support  roll  «>f  till-  direct  neighbour  S  of 

cell  Using  now  the  properties  (64)  of  the  reconstruc¬ 
tion  ,hts,  eq.  (166)  becomes: 


Again,  as  ( a.npn )  is  constant  along  each  straight  edge, 
the  previous  expression  for  the  space  discretization  can  be 
rewritten  as  a  continuous  contour  integral  if  the  number 
of  Gauss  points  O  on  the  edges  is  larger  than  equal  to 


[°<SM  {q)^<bLP(Sii)' 

k  i 

qp  +  7icf ■rf* 

»««  ' 

+  0(hk)Vk+l 


[^(“■^)]  M  =  (hEE*s°- 


R  O 


From  now  on,  the  reasoning  proceeds  in  exactly  the  same 
way  as  in  the  first  proof  leading  to  cq.  (165). 

Q 
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B  First  Order  Accuracy 

Theorem  B.l 

First  order  accuracy  can  be  obtained  on  irregular  meshes 
when  only  cell- wise  constant  solution  reconstruction  is 
used. 

Proof  of  Theorem  B.l: 


As  the  scheme  is  only  stable  if  the  CFL-nutuber  is  smaller 
than  one,  it  follows  that  At  =  O(h).  Hence,  all  coeffi¬ 
cients  of  the  higher  order  derivatives  are  O(h)  and  one 
gets: 


1 


it* 

P 


fpn.Aso 


.  +  O(h) 


Let  us  first  write  down  the  truncation  error  Tp  in  cell  P 
at  time  level  n  for  the  linear  scheme  given  by  cq.  (101) 
where  now  =  qp  and  Q^q  —  </h  with  q  the  exact 

solution  of  the  equivalent  diffcrentiitl  equation: 


1 


sU 

P  ~ 


+ 


+ 


Fapn  4-  Axpp.  qx\ p  -f  Aypp.  Vy| p 

+  ^At'pr.  q*x\'p  +  -Aj/p/i-  4yy| '/» 

+  Axpn.Aypii.  fTyl'p]  }  .A so 

0(h 2) 


From  eq.  (97)  it  follows  that: 


*1  ’/> 
vx.r;. 
9y«l" 

«..|J 


-«  <h\'p  ~b  qy\'p 
U .  Vx.*|p  —  b.  9-rylp 

—  <1.  ?Xy|’/>  -  b.  <}yy|p 
a  ?l»|p  +2 ab.  9xyl;i  T  b  ■  Vyy||, 


and  furthermore: 


EE(  ar/r9r  +  npn-?p)  -&so 


u  o 


VP-  EE  ii.iipitA.il 


n  o 


—  q'p.  d  a.ii.ds 
JlKlr 


=  0 


so  that  'Ip  becomes: 


Knowing  that 

^  E  (“-urn)  .rpo.Aso  —  « 
lp  R  o 

if  the  number  of  Gauss  points  is  at  least  equal  to  1  and 

since 

rpR  =  rpo  —  Fro 
Tp  can  be  rewritten  as: 


tj:  = 


+ 


1 

uP 


[EE<  UpR.CRo  -f  Opa-fpo)  A  so 


O(h) 


tl 

P 


Taking  into  account  that 

ln  i*» 

tfy-  =  +^)(/i) 

I  n  Ip 

one  finds 


-  j^EE  ["p/r-nro  Vq 
1  n  o 

+  a^n.rpo.  -Aso- 

+  O(h) 

Delining 

tp  =  rpo ■ 
t]l  =  rno  ■ 
tp  =  O(h) 


( r  — 


■  vx!’,*. 

•  t*lp 
2  A/ 


~  E  E  npn-Azpn.Aso  -  « 

r  II  o 

?rr  E  E  “p* *»rn -A,°  ~ 6 

‘no 

^p-  E  E  arn&rrR&l,o  +  « 

i  ~1’  H  O 

iJKZ  E  E  *PR  &yPR  &»0  +  b2  — 

~  9  it  O 

—  E  E  “PH-ArpH  AypR.Aso 


•  tvvlp 


-fa&.Al]  .  fliy|£ 


+  CJ(hJ)  +  C7(AI) 


Tp  can  be  regarded  iis  the  space  discretization  of  7  in¬ 
stead  of  q  plus  a  term  of  order  h: 


Tp  = 

“  E  E  [“p*^p  +  arn -^k]  *o 

1  R  O 

+C>(h) 

=  d  (t?)  -i-  7;:  (iso) 

The  global  error  of  the  .Jicme  of  «q,  \  jOl)  is  now  tielined 
as: 


Ep  =  q(rp,  <n)  -  q'i- 

with  q  the  exact  solution  of  the  analytical  problem  (97) 
and,  due  to  the  linearity  of  the  scheme,  the  global  error 
satisfies: 
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Tp  = 

+  jj-  52  52  +  a p ,(.£'!()  .Ajo  (170) 

ll/>  /t  o 

or 

-£P)4I>(B?)=Tp  (171) 

The  global  error  can  be  decomposed  as 

E",  =  Kr  +  Ep  (172) 

where 

E',',  =  'Jf,  ^  rro.  fyj"  =  0(1.) 

r»i  —  r**»  l''»» 

C/p  —  IV  |>  “  Li  p 

Equation  (171)  becomes  then: 


-  ft! +£>(») -Cl(0 

Since  the  scheme  is  stable,  one  linds  that  the  error  Ep 
remains  O(h)  over  a  fixed  time  interval.  It  follows  then 
that  the  global  error  E ",  is  O(h).  The  scheme  given  by 
cq.  (101)  is  therefore  first  order  accurate  in  space  and 
time. 
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FOREWORD  AND  INTRODUCTION 

Numerical  methods  for  the  solution  of  field  problem- 
s  using  unstructured  grids  have  reached  a  High  de¬ 
gree  of  maturity.  In  Computational  Fluid  Dynamics 
(CFD),  their  impact  has  come  relatively  late.  CFD 
has  been  traditionally  dominated  by  structured  grid 
solvers.  These  simpler  solvers  were  used  on  relatively 
simple  domains  that  were  still  of  engineering  inter¬ 
est,  e.  .  an  airfoil  or  a  wing.  Only  as  the  computa¬ 
tional  power  of  hardware  increased  to  current  levels 
did  the  possibility  of  computing  geometrically  com¬ 
plex  designs  become  a  reality.  It  was  at  this  point 
that  methods  based  on  unstructured  grids  started  to 
have  an  impact  on  mainstream  CFD. 

The  following  set  of  notes  are  part  of  a  larger  set  that  I 
had  originally  planned  for  this  short  course.  As  the  fi¬ 
nal  program  only  called  for  me  to  describe  grid  gener¬ 
ators,  adaptive  refinement  schemes,  visualization  and 
parallelization  issues,  I  have  restricted  them  to  these 
topics.  The  topic  of  visualization  has  been  left  out 
entirely .  Given  the  description  of  optimal  data  struc¬ 
tures  for  grid  generation,  the  reader  can  easily  devise 
optimal  search  algorithms  for  visualization. 

I  have  preceded  these  topics  with  a  short  chapter  that 
places  CFD  among  related  disciplines,  tries  to  define 
its  aims,  and  focuses  on  the  end-product  of  CFD  re¬ 
search:  flow  simulation  codes.  The  newcomer  should 
find  this  section  particularly  interesting. 

The  reference  section  covers  all  topics  originally  in¬ 
tended  for  this  course.  The  reader  is  encouraged  to 
consult  all  of  these  for  a  more  complete  picture  of 
CFD. 

1.  CFD:  GENERAL  CONSIDERATIONS 

Before  going  into  a  detailed  description  of  algorithm- 
s  used  for  Computational  Fluid  Dynamics  (CFD),  it 
9eema  proper  to  place  this  discipline  among  related 
disciplines.  CFD  is  part  of  Computational  Mechanics, 
which  in  turn  is  part  of  Simulation  Techniques.  The 
aim  is  to  approximate  physically  relevant  situations 
and  phenomena  using  computers.  In  CFD,  this  is  ac¬ 
complished  by  solving  numerically  Partial  Differential 
Equations  (PDEs),  or  by  following  the  interaction  of 
a  large  numbers  of  particles.  Due  to  its  relevance  to 


the  aerospace  industry,  as  well  as  to  most  manufac¬ 
turing  processes,  CFD  has  been  pursued  actively  ever 
since  the  first  digital  computers  were  developed.  The 
Manhattan  project  was  a  major  testbed  and  benefi¬ 
ciary  of  early  CFD  technology.  Concepts  like  artificial 
dissipation  date  from  this  time. 

CFD,  by  its  very  nature,  encompasses  a  variety  of 
disciplines,  which  may  be  enumerated  in  the  following 
order  of  importance: 

-  Engineering:  We  live  in  a  technology-driven 
world.  Engineering  provides  the  reason  why  we 
pursue  CFD.  Forget  the  romantic  vision  of  re¬ 
searchers  mimicking  art  for  art's  sake.  This  is 
engineering,  and  if  a  code  can  not  guide  an  engi¬ 
neer  to  better  products,  it  r  simply  useless. 

-  Physics:  Physics  explains  the  phenomena  to  be 
simulated  for  engineering  purposes,  and  provides 
possible  approximations  and  simplifications  to  ab 
initio  physics.  For  example,  the  potential  ap¬ 
proximation,  where  applicable,  represents  CPU 
savings  of  several  orders  of  magnitude  as  com¬ 
pared  to  full  Reynolds- Averaged  Navier-Stokes 
(RANS)  simulations.  It  is  the  task  of  this  dis¬ 
cipline  to  outline  the  domains  of  validity  of  the 
different  assumptions  and  approximations  that 
are  possible. 

*  Mathematics:  Mathematics  has  three  different 
types  of  input  for  CFD  applications.  These  are: 

a)  Classic  Analysis,  which  discusses  the  nature, 
boundary  conditions,  Green  kernels,  under¬ 
lying  variational  principles,  adjoint  opera¬ 
tors,  etc.  of  the  PDEs; 

b)  Numerical  Aualysis,  which  describes  the  sta¬ 
bility,  convergence  rates,  uniqueness  of  solu¬ 
tions,  well-posedness  of  numerical  schemes, 
etc.;  and 

c.)  Discrete  Mathematics,  which  enables  the 
rapid  execution  of  arithmetic  operations  (try 
solving  a  square-root  by  hand). 

-  Computer  Science:  Computer  science  has  mush¬ 
roomed  into  many  subdisciplints.  The  most  im¬ 
portant  ones  for  CFD  are: 

a)  Algorithms,  which  describe  how  to  perform 
certain  operations  in  an  optimal  way  (e.g. 
search  of  items  in  a  list  or  iu  space); 

b)  Coding,  so  that  the  final  code  is  portable, 


easy  to  modify  and/or  expand,  easy  to  un¬ 
derstand,  user-friendly,  etc.; 

c)  Software,  which  not  only  encompasses  com¬ 
pilers,  debuggers  and  operating  systems,  but 
also  advanced  graphics  libraries  (c.g.  try  do¬ 
ing  what  you  can  do  with  GL  in  PHIGS): 
and 

d)  Hardware,  which  drives  not  only  the  realm 
of  ever  expanding  applications  that  would 
have  been  unthinkable  a  decade  ago,  but  also 
iutluences  to  a  large  extent  the  algorithms 
employed  and  the  way  codes  are  written. 

-  Visualization  Techniques:  The  vast  amounts  of 
data  produced  by  modern  simulations  need  to  be 
displayed  in  a  sensible  way.  This  not  only  refers 
to  optimal  algorithms  to  filter  and  traverse  the 
data  at  hand,  but  also  to  ways  of  seeing  this  da¬ 
ta  (plane-cuts,  iso-suri'aces,  X-rays,  stereo- vtsion, 
etc.). 

-  User  Community:  The  final  product  of  any  CFD 
effort  is  a  code  that  is  to  be  used  for  engineer¬ 
ing  applications.  Successful  codes  tend  to  have 
a  user-community.  This  introduces  human  fac¬ 
tors  which  have  to  be  accounted  for:  confidence 
and  benchmarking,  documentation  and  educa¬ 
tion,  the  individual  motivation  of  the  end-users, 
ego-factors,  not-invented-here  syndrome,  etc 

1.1  The  CFD  Code 

The  end-product  of  any  CFD  effort  is  a  code  that  is 
to  be  used  for  engineering  applications.  The  quality 
of  this  tool  will  depend  on  the  quality  of  ingredients 
listed  above.  Just  as  a  chain  is  only  ns  strong  as  it- 
s  weakest  member,  so  is  a  code  only  as  good  as  the 
worst  of  its  ingredients.  Given  the  breadth  and  vari¬ 
ety  of  disciplines  required  for  a  good  code,  it  is  not 
surprising  that  only  a  few  codes  make  it  to  a  pro¬ 
duction  environment,  although  so  many  are  written 
worldwide.  Once  a  CFD  code  leaves  the  realms  of  re¬ 
search,  it  becomes  a  tool,  i.e.  a  part  of  the  service  in¬ 
dustry.  CFD  codes,  like  other  simulation  codes,  have 
certain  properties.  Some  of  these  arc: 

-  EU:  Ease  of  Use  (Problem  set-up,  User  interface. 

-  DO:  Documentation  (Manuals,  Help,  .) 

-  GF:  Geometric  Flexibility 

-  'FT:  Turnaround  Time  (Set-up  to  end-result) 

-  DM:  Benchmarking 

-  AC:  Accuracy 

-  SP:  Speed 

As  any  other  product,  CFD  codes  have  a  customer 
base.  This  customer  base  inay  be  categorized  by  the 
number  of  times  a  certain  application  has  to  be  per¬ 
formed.  Three  main  types  of  end-users  may  be  iden¬ 
tified: 

a)  Those  that  require  a  few  runs  on  new  configura¬ 
tion*  every  so  often  to  guide  them  in  their  designs 


(the  general  purpose  run  of  manufacturing  industries 
and  process  control); 

l>)  Those  that  require  a  large  number  of  runs  to  op¬ 
timize  highly  sophisticated  products  (e.g.  airfoil  or 
wing  optimization);  and 

c)  Those  that  require  a 

few  very  detailed  runs  on  extremely  simple  geome¬ 
tries  in  order  to  understand  or  discover  new  physics 
(the  NASA/NRL/LLNL/LANL/ctc.  scenario,  where 
each  run  takes  at  least  500  hours  of  CRAY-tinw). 

According  to  this  frequency  of  runs,  the  priorities 
change,  as  can  be  seen  from  the  following  table: 


Gen. Purp. /Anal.  Des./Optim.  New  Phys. 


0(1) 

0(1,000) 

O(I0) 

days 

seconds 

months 

EU 

SP 

AC 

DO 

TT 

BM 

GF 

GF 

SP 

TT 

AC 

TT 

BM 

BM 

EU 

AC 

EU 

GF 

SP 

DO 

DO 

The  message  is  clear:  before  comparing  codes,  ask 
how  often  the  code  is  to  be  used  on  a  particular  ap¬ 
plication,  how  qualified  the  personnel  is,  what  the 
maximum  allowed  turnaround  time  is,  the  expected 
accuracy  and  the  resources  available.  Only  then  can 
a  proper  choice  of  codes  be  made. 

1 .2  Porting  Research  Codes  to  an  Industrial  Context 

Going  from  a  research  code  to  an  industrial  code  re¬ 
quires  a  major  change  of  focus.  Industrial  codes  arc 
characterized  by: 

-  Extensive  manuals  and  other  documentation; 

-  24-hour  hotline  answering  service; 

-  Customer  support  team  for  special  request¬ 
s/applications; 

-  Incorporation  of  changes  through  releases  and 
training. 

In  short,  they  require  an  organization  to  support 
them.  Many  researchers  (particularly  members  of  a- 
cademia  who  always  longed  for  their  own  company) 
seem  to  ignore  this.  The  result  is  a  proliferation  of 
small  companies  that  neither  satisfy  customer  needs, 
uor  attain  a  high  scientific  level  in  their  cedes.  Thus, 
these  companies  are  characterized  by  being  short¬ 
lived,  as  well  as  being  one-idea  or  one-product  ori¬ 
ented. 

2.  UNSTRUCTURED  GRID  GENERATION 

Consider  the  task  of  generating  an  arbitrary  unstruc¬ 
tured  mesh  in  a  given  computational  domain.  The 
information  required  to  perform  this  task  is* 


a)  A  description  of  the  bounding  surfaces  of  the  do¬ 
main  to  be  discretized; 

b)  A  description  of  how  the  element  size,  shape  and 
orientation  should  be  in  space; 

c)  The  choice  of  element  type; 

d)  The  choice  of  a  suitable  method  to  achieve  the 
generation  of  the  desired  mesh, 

The  most  common  ways  to  provide  these  four  pieces 
of  information  are  discussed  in  the  following. 

2.1  Description  of  the  Bounding  Surfaces  of  the 

.Domain 

5.1  Analytic  Functions:  This  is  the  preferred  choice 
if  a  CAD-CAM  data  base  exists  for  the  description  of 
the  domain.  In  this  case,  Splines,  13-Splines  or  other 
types  of  functions  ate  used  to  describe  the  surface 
of  the  domain.  An  important  characteristic  of  this 
approach  is  that  the  surface  is  continuous,  i.e.  there 
exist  no  ‘holes’  in  the  information. 

5.2  Discrete  Data:  Here,  instead  of  functions,  a  cloud 
of  points  describes  the  surface  of  the  computational 
domain.  This  choice  may  be  attractive  when  no  CAD- 
CAM  data  base  exists.  Commercial  digitizers  can 
gather  surface  point  information  at  high  speeds,  (> 
30,000  points/eec),  allowing  a  very  accurate  descrip¬ 
tion  of  a  scaled  model  or  the  full  configuration  [Gd.l). 
Notice  that  this  approach  leads  to  a  discontinuous 
surface  description.  In  order  r.ot  to  make  any  mis¬ 
takes  when  discretizing  the  surface  during  mesh  gen¬ 
eration,  only  the  points  given  in  the  cloud  of  points 
should  be  selected. 

2.2 

Variation  of  Element  Size.  Shane  and  Orientation 
in  Space 

V. !  Internal  Measure  of  Grid  Quality:  The  idea  here 
is  to  start  from  a  given  surface  mesh.  After  the  intro¬ 
duction  of  a  new  point  or  element,  the  quality  of  the 
current  grid  or  front  is  assessed.  Then,  a  new  point 
or  element  is  introduced  in  the  most  critical  region. 
This  process  is  repeated  until  either  a  mesh  that  sat¬ 
isfies  a  preset  measure  of  quality  is  achieved  (Gv.ll), 
or  the  number  of  faces  in  the  front  has  shrunk  to  ze¬ 
ro  (Ga.flj.  This  technique  works  well  for  equilateral 
elements,  requiring  minimal  user  input.  On  the  other 
hand,  it  is  not  very  general,  as  the  surface  mesh  needs 
to  be  provided  as  part  of  procedure. 

V.2  Analytical  Function^:  In  this  case,  the  usee  codes 
in  a  small  subroutine  the  desired  variatio  *.  of  elemcn- 
t  size,  shape  and  orieutation  in  space.  Needless  to 
say,  this  is  the  least  general  of  all  procedures,  requir¬ 
ing  new  coding  for  every  new  problem.  On  the  oth¬ 
er  hand,  if  the  same  problem  needs  to  be  discretized 
many  times,  an  optimal  discretization  may  be  coded 
iu  this  way.  Although  it  may  seem  inappropriate  to 
pursue  such  an  approach  within  unstructured  grids, 
the  reader  may  be  reminded  that  moat  current  airfoil 
calculations  are  carried  out  using  this  approach. 

V.3  Boxes:  If  all  that  is  required  are  regions  with 


uniform  mesh  sizes,  one  may  define  a  scries  of  box¬ 
es  in  which  the  clement  size  is  constant.  For  each 
location  in  space,  the  element  size  taken  is  the  small¬ 
est  of  all  the  boxes  containing  the  current  location. 
When  used  in  conjunction  with  surface  definition  via 
quad/octrees,  one  can  automate  the  point  distribu¬ 
tion  process  completely  in  a  very  elegant  way  (Go.l). 

V .4  I\int/ Line/Surface  Sources:  A  more  flexible  way 
that  combines  the  smoothness  of  functions  with  the 
generality  of  boxes  or  other  discrete  elements  is  to  de¬ 
fine  sources.  As  an  example,  consider  the  line  source 
defined  by  the  function: 

«{x)*«o[l+(^p-f2)7]  .  (2.1) 

The  definition  of  r(x)  may  be  inferred  from  Fig¬ 
ure  2.1.  As  one  can  see,  a  very  flexible  way  of  defin¬ 
ing  mesh  spacings  is  obtained  with  the  4  input  pa¬ 
rameters  $o(ro,ri,7.  If  one  collapses  the  two  points 
xj  -♦  xi ,  a  point-source  is  obtained.  It  is  a  simple 
matter  to  introduce  these  sources  interactively  with 
the  mouse  once  the  surface  data  is  available.  Ob¬ 
viously,  the  number  of  sources  should  be  kept  small 
{N,  <  50)  in  order  not  to  incur  penalties  in  user  set¬ 
up  time  and  grid  generation  time. 
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Figure  2.1 :  Example  of  a  Source  to  Define 
Element  Size  in  Space 

V.5  Background  Grids:  Here,  a  coarse  grid  is  provid¬ 
ed  by  the  user.  At  each  of  the  nodes  of  this  back¬ 
ground  grid,  the  element  size,  stretching  and  stretch¬ 
ing  direction  are  specified.  While  very  general  and 
flexible,  and  particularly  suited  to  adaptive  remesh¬ 
ing,  the  input  of  suitable  background  grid*  for  com¬ 
plex  3-D  configurations  can  become  a  tedious  pro¬ 
cess.  Therefore,  moat  available  grid  generator.-  em¬ 
ploy  background  grids  in  conjunction  with  source- 
definitions  in  order  to  generate  the  first  mesh. 

2.3  Element  Type 

Almost  all  current  unstructured  grid  generators  can 
only  generate  triangular  or  tetrahedral  elements.  If 
quad-elements  in  2-D  are  required,  they  are  generated 
by  the  following  five-stage  process: 

Q.l  Generate  a  triangular  mesh  with  elements  that 


are  four  times  as  big  as  the  quad-element  *  re¬ 
quired. 

Q.2  Fuse  os  many  pairs  of  triangles  into  quads  as  pos¬ 
sible  without  generating  quads  that  are  too  dis¬ 
torted.  This  process  will  leave  some  triangles  in 
the  domain. 

Q.3  Smooth  the  mesh  of  t  riangles  and  quads 
Q.'l  H-refine  globally  tho  mesh  of  triangles  and  quads. 
For  the  triangles,  introduce  an  additional  point 
in  the  element  (see  Figure  2.2).  In  this  way.  the 
resulting  mesh  will  only  contain  quads.  More¬ 
over,  the  quads  will  now  be  of  the  desired  size. 


Figure  2.2:  Generation  of  Quad-Meshes  from  Triangles 


Q.5  Smooth  the  final  mesh  of  quads. 

The  procedure  outlined  above  will  not  work  in  3-D. 
There  is  a  large  effort  of  3-D  brick  generation  tech¬ 
nology  at  Sandra  Labs  at  the  present  time. 

2.4  Methods 

There  appear  to  be  only  the  two  following  ways  to  fill 
space  with  an  unstructured  mesh: 

M.l  Fill  Ennitv.  i.e.  Not  Yet  G ridded  Space:  The 
idea  here  is  to  procede  into  as  yet  ungridded  space  un¬ 
til  the  complete  computational  domain  is  filled  with 
elements.  This  has  been  shown  diagramatically  in 
Figure  2.3.  The  methods  falling  unaer  this  catego¬ 
ry  are  the  so-called  advancing  front  algorithms.  The 


‘front’  denotes  the  boundary  botwoou  tho  togion  in  »- 
pace  that  has  been  filled  with  elements  and  that  which 
is  empty. 


Figure  2.3:  Advancing  Front  Method 


M.2  Itpprove  an  Existing  Grid:  In  this  case,  an  exist¬ 
ing  grid  is  modified  by  the  introduction  of  new  points. 
After  tho  introduction  ofoach  point,  tho  grid  is  recon¬ 
nected  or  reconstructed  locally  in  order  to  improve 
the  mesh  quality.  This  procedure  has  been  sketched 
in  Figure  2.4.  In  most  coses,  the  Delauney  circutn- 
scircle  or  circumsphcrc  criterion  is  used  to  reconnect 
the  points.  Given  the  duality  between  Voronoi  tesse- 
lations  and  the  triangulations  obtained  using  the  Dc- 
launey  criterion,  the  methods  tailing  under  this  cate¬ 
gory  have  been  called  Voronoi  algorithms. 
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In  the  following  matrix,  a  brief  summary  is  given  of 
possible  combinations  for  specifying  element  size  and 
shape  in  space,  the  method  employed  to  generate  the 
mesh  as  attempted  by  various  authors: 
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2.1  Other  Methods 

Many  other  method*  of  generating  meshes  that  are 
specially  suited  to  a  part  icular  application  may  be  de¬ 
veloped  If  one  knows  the  approximate  answer  (say  wo 
want  to  solve  the  same  wing  time  and  time  again),  the 
specialized  development  of  an  optimal  mesh  makes 
good  sense,  in  many  of  these  cases  (e  g.  an  O-mesh 
for  a  subsonie/tronsouic  steady-state  airfoil  calcula¬ 
tion),  grids  generated  by  the  more  general  methods 
listed  above  will  tend  to  be  larger  than  the  specialized 
ones  for  the  same  final  accuracy.  The  main  methods 
falling  under  this  specialized  category  are: 

a)  Simple  Mappings.  In  this  case,  it  is  assumed  that 
the  complete  computational  domnm  can  be  mapped 
into  a  single  quad  or  cube.  The  dist-bution  of  points 
and  elements  in  space  is  controlled  either  by  an  alge¬ 
braic  function,  or  by  the  solution  oi  a  Partial  Differ¬ 
ential  Equation  in  the  transformed  space.  Needless 
to  say,  the  number  of  points  on  opposing  faces  of  the 
mapped  quad  or  cube  have  to  match  line  by  line. 

b)  Macro-Element  Approach:  Here,  the  previous  ap¬ 
proach  is  applied  on  a  local  level  by  first  manually  or 
semi-manually  disrretiri  »g  the  domain  with  large  ele¬ 
ments.  These  large  elements  are  subsequently  divided 
up  into  smaller  elements  using  simple  mappings  [G- 

—  ii  / -  _ nr\  i ..  ,v _ 

i  i^vut  4>t;j.  ui  uiv  uciubpiibc  wiuitiumt}, 

this  approach  has  been  termed  multi-block’,  and  a 
whole  service  industry  dedicated  to  the  proper  con¬ 
struction  of  grid*  has  evolved  (Gm.'i-h). 


Fjg.’.!£2-iS:  Macro-Element  or  Multi-Block  Approach 


c)  ■Uniforni.Uackgfonnd  Grid:  Tor  p.obloms  that  re¬ 
quire  uniform  grids  (e  g  IHdnr  Cross- Section  cal¬ 
culations  and  homogeneous  turbulence),  most  of  the 
mesh  covering  the  computational  domain  can  be  read¬ 
ily  obtain  from  a  uniform  grid.  This  grid  is  called 
a  background  grid,  because  it  is  laid  over  the  com¬ 
putational  domain.  At  the  boundaries,  the  mesh 
may  be  modified  to  conform  to  the  surfaces,  although 
many  Legoland  codes  omit  even  this.  After  modify¬ 
ing  these  surface  elements,  the  mesh  is  smoothed,  in 
order  to  obtain  a  more  uniform  discretization  close  to 
the  boundaries  [Gr.l.Gr.2],  The  procedure  has  been 
sketched  in  Figure  2.6. 


Figure  2-tf:  Mesh  Genera'  an  by  Perturbation  of  a 
Begu’.u'  Mesh 

2.  GRID  GENERATION  USING  THE 
ADVANCING  FRON  T  METHOD 

After  describing  the  general  strategies  currently  avail¬ 
able  to  generate  unstructured  grids,  the  advancing 
front  method  will  now  be  explained  in  more  detail. 
The  aim  is  to  show  what  it  takes  to  make  any  of 
the  methods  outlined  iu  the  previous  sections  work. 
Many  of  the  data  structures,  search  algorithms,  and 
general  coding  issues  carry  over  to  the  other  methods. 
'The  advancing  front  technique  consists  algorithmical¬ 
ly  of  the  following  steps: 

F.l  Define  the  spatial  variation  of  element  size, 
stretchings,  and  stretching  e  rections  for  the  ele¬ 
ments  to  be  created.  In  most  cases,  this  is  accom¬ 
plished  with  a  combination  of  background  grids 
and  sources  as  outlined  above. 

F.2  Define  the  boundaries  of  the  domain  to  be  grid- 
ded.  This  is  typically  accomplished  by  splines  in 
2-D  and  surface  patches  in  3-D. 

F.3  Using  the  information  stored  on  the  background 
grid,  set  up  faces  on  all  these  boundaries.  This 


yields  the  initial  front  of  faces.  At  the  same  time, 
find  the  gcnctati  u  parameters  (element  size,  el¬ 
ement  stretchings  and  stretching  directions)  for 
these  faces  from  the  background  grid. 

F.4  Select  the  next  face  to  be  deleted  from  the  fron- 
t;  in  order  to  avoid  large  elements  crossing  over 
regions  of  small  elements,  the  face  forming  the 
smallest  new  element  is  selected  as  the  next  face 
to  be  deleted  from  the  list  of  faces. 

F.j  For  the  face  to  be  deleted: 

P.5.1  Select,  n  ‘best  point.*  position  for  the  intro¬ 
duction  of  n  new  point  II’MKW. 

F .8.2  Determine  whether  a  point  exists  m  the  al¬ 
ready  generated  grid  that  should  bo  used  in 
lieu  of  the  new  point.  If  there  is  such  a  point, 
set  *his  point  to 

Il’SEW  and  continue  searchin0  (go  to  P.5.2). 


F.6.3  Determine  whether  the  element  formed  with 
the  selected  point  IPNEW  docs  not  cro53  ary 
given  faces.  If  it  does,  select  a  new  point  as 
IPHF.V  and  try  again  (go  to  F.5.3). 

P.6  Add  the  new  element,  point,  and  faces  to  their 
respective  lists. 

P.7  Find  the  gouoration  parameters  for  the  new  faces 
from  the  background  grid. 

P.8  Delete  the  known  faces  from  the  list  of  faces. 

P.9  If  there  are  any  faces  left  in  the  front,  go  to  F.4. 

The  complete  grid  generation  of  a  simple  2-D  domain 
using  the  advancing  front  technique  is  shown  in  Fig¬ 
ure  3.1.  In  the  following,  individual  aspects  of  this 
general  algorithmic  outlined  are  described  in  more  de¬ 
tail. 
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Figure  3.1:  Grid  Generation  Using  the  Advancing  Front  Technique 


N-7 


31.  Checking  the  Intersection  of  Faces 

The  most  important  ingredient  of  the  advancing  front 
generator  is  a  reliable  and  fast  algorithm  for  checking 
whether  two  faces  intersect  each  other.  Experience 
from  practical  applications  indicates  that  even  slight 
changes  in  tLis  portion  of  the  generator  greatly  influ¬ 
ence  the  final  mesh.  As  with  so  many  other  problem- 
s  in  computational  geometry,  checking  whether  two 
faces  intersect  each  other  seems  trivial  for  the  eye, 
but  is  complicated  to  code.  The  problem  is  shown 
in  Figure  3.2.  The  checking  algorithm  is  based  on 
the  following  observation:  two  triangular  faces  dc  not 
intersect  if  no  side  of  either  face  intersects  the  other 
face.  The  idea  then  is  to  build  all  possible  side-face 
combinations  between  any  two  faces  and  check  them 
in  turn.  If  no  intersection  is  found,  then  the  faces  do 
not  cross.  With  the  notation  defined  in  Figure  3.3, 


the  intersection  point  is  found  as 

x/  +a1gi  +a2gj  =  x,  +a3ga  ,  (3.1) 

where  the  gj-vectors  form  a  covariant  basis.  Using 
the  contrav&riant  basis  g*  defined  by 

g<  gi=«j  ,  (3-2) 

where  denotes  the  Kronecker-delta,  the  a*  are  given 
by 

=(*.  -*/)V  . 

«2  =  (x,  -x/)  g2  .  (3.3) 

a3  =  (x,-x.)  g3  . 


Because  we  are  only  interested  in  a  triangular  surface 
for  the  gi,  gi  -  plane,  we  define  another  quantity  sim¬ 
ilar  to  the  third  shape  function  for  a  linear  triangle: 

a4  =  1  —  a1  —  at2  .  (3.4) 

Using  the  o',  two  faces  can  be  considered  as  ‘-rosse 
if  they  only  come  close  together.  Then,  in  order  tor 
the  side  not  to  cross  the  face,  at  least  one  of  the  o’ 
has  to  satisfy 

t  >  max(— or*,a*  —  1)  ,  *  =  1,4  ,  (3.5) 

where  I  is  a  predefined  tolerance.  By  projecting  the  g, 
onto  their  respective  unit  contravariant  vectors,  one 
can  obtain  the  actual  distance  between  a  face  and  a 
side.  The  criterion  given  by  Eqn.(3.5)  would  then  be 
replaced  by  (see  Figure  3.4): 

d>  —  mar(— a'.o1  -  1)  ,t=l,4  (3.6) 

|g' I 


Figure  3.2:  Crossing  of  Two  Faces 


FACE 


Figure  3.3:  Face-Side  Combination 
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The  first  form  (Eqn.(3.5))  produces  acceptable  grids. 
If  the  face  and  the  side  have  points  in  common,  then 
the  «*  will  all  be  either  1  or  0.  As  both  Eqn.(3.5)  and 
Eqn.(3.6)  will  not  be  satisfied,  special  provision  has 
to  be  made  for  these  cases.  For  each  two  faces,  six 
side-face  combinations  are  possible.  Considering  that 
on  average  about  40  close  faces  need  to  be  checked, 
this  way  of  checking  the  crossing  of  faces  is  very  CPU¬ 
intensive.  When  it  was  first  implemented,  this  portion 
of  the  grid  generation  code  took  more  than  80%  of  the 
CPU  time  required.  In  order  to  reduce  the  work  load, 
a  three-layered  approach  was  subsequently  adopted: 

a)  Min/Max-search:  The  idea  here  is  to  disregard 
all  face-face  combinations  where  the  distance  be¬ 
tween  faces  exceeds  some  prescribed  minimum  dis¬ 
tance.  This  can  be  accomplished  by  checking  the 
maximum  amd  minimum  value  for  the  coordinates 
of  each  face.  Faces  cannot  possibly  cross  each  other 
if,  at  least  for  one  of  the  dimensions  i  =  1,2,3,  they 
satisfy  one  of  the  following  inequalities 

maz/ae.l  (xA<xB>xc)  <  ">”»/<»«*  (^  >  XB>  x<?)  _  , 

(3.7a) 

min/aetl  (xA>xB’xc)  >  m ax/oc*3  (xA,xB<xc)  +  d  < 

(3.76) 

where  A,  B,C  denote  the  corner  points  of  each  face. 

b)  Local  element  coordinates:  The  purpose  of  check¬ 
ing  for  face-crossings  is  to  determine  whether  the  new¬ 
ly  formed  tetrahedron  breaks  already  given  faces.  The 
idea  is  to  extend  the  previous  Min/Max-criterion  with 
t'ue  shape  functions  of  the  new  tetrahedron.  If  all 
the  points  of  a  given  face  have  shape-function  val¬ 
ues  that  have  the  same  sign  and  lie  outside  the 
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[-<,  1  ■+  tj  interval,  then  the  tetrahedron  cannot  pos¬ 
sibly  cross  the  face.  Such  a  face  is  therefore  disre¬ 
garded. 

c)  ln-depth  analysis  of  side-face  combinations:  Al- 
1  the  faces  remaining  after  the  filtering  process  of  steps 

a)  and  b)  are  analyzed  using  side-face  combinations 
as  explained  above. 

Each  of  these  three  filters  requires  about  an  order 
of  magnitude  more  CPU-time  than  the  preceding 
one.  When  implemented  in  this  way,  the  face-crossing 
check  requires  only  25%  of  the  total  grid  generation 
time.  When  operating  on  a  vector  machine,  loops  are 
performed  over  all  the  possible  combinations,  building 
the  g,,g\a‘,  etc.  in  vector  mode.  Although  the  vec¬ 
tor  lengths  are  rather  short,  the  chaining  that  results 
from  the  lengthy  mathematical  operations  involved 
results  in  acceptable  megaflop-rates  on  the  CRAY- 
XMP. 

3.2.  Data  Structures  to  Minimize  Search  Overheads 

The  operations  that  could  potentially  reduce  the  ef¬ 
ficiency  of  the  algorithm  to  0(Nl  i)  or  even  0(Ar2) 
are  (see  section  2): 

a)  Finding  the  next  face  to  be  deleted  (step  F.4); 

b)  Finding  the  closest  given  points  to  a  new  point 
(step  F.5.2); 

c)  Finding  the  faces  adjacent  to  a  given  point  (step 
F  5.3); 

d)  Finding  for  any  given  location  the  values  of  gen¬ 
eration  parameters  from  the  background  grid 
(steps  F.3  and  F.7).  This  is  an  interpolation 
problem  on  unstructured  grids. 

The  verb  ‘find’  appears  in  all  of  these  operations.  The 
main  task  is  to  design  the  best  data  structures  for 
performing  the  search  operations  a)-d)  as  efficiently 
as  possible.  These  data  structures  are  typically  bi¬ 
nary  trees  or  more  complex  trees.  They  were  devel¬ 
oped  in  the  1960s  for  Computer  Science  applications. 
Many  variations  are  possible  (see  [Ds.lj).  As  with 
flow  solvers,  there  does  not  seem  to  be  a  clearly  de¬ 
fined  optimal  data  structure  that  all  current  grid  gen¬ 
erators  use.  For  each  of  the  data  structures  currently 
employed,  one  can  find  pathological  cases  where  the 
performance  of  the  tree-seach  degrades  considerably. 
The  data  structures  I  have  ueed  are: 

-  Heap-lists  [Ds.1-4],  to  find  the  next  face  to  be 
deleted  from  the  front; 

-  Quad-trees  (2-D)  and  Octrees  (3-D) 
[Ds.l,2,Go.l],  to  locate  points  that  are  close  to 
any  given  location; 

-  N-trees,  to  determine  which  faces  are  adjacent  to 
a  point. 

Combining  these  data-atructuree,  one  can  also  derive 
an  optimal  interpolation  algorithm  for  unstructured 
grids  [G&.4]. 


3.2.1  Heap  List  for  the  Faces 

Heap  lists  are  well-known  binary  tree  data  structures 
in  computer  science  [Ds.1-4].  The  ordering  of  the  tree 
is  accomplished  by  requiring  that  the  key  of  any  fa¬ 
ther  (root)  be  smaller  than  the  keys  of  the  two  sons 
(branches).  An  example  of  a  tree  ordered  in  this  man¬ 
ner  is  given  in  Figure  3.5,  where  a  possible  tree  for  the 
letters  of  the  word  ‘example’  is  shown.  The  letters 
have  been  arranged  according  to  their  place  in  the  al¬ 
phabet.  We  must  now  devise  ways  to  add  or  delete 
entries  from  such  an  ordered  tree  without  altering  the 
ordering.  In  the  present  case  faces  have  to  added  as 
entries  into  the  tree.  Therefore,  replace  ‘entry’  by 
‘face’.  The  ideas  that  follow  use  the  heap-sort  and 
heap-search  algorithms  [Ds.2,3]  to  determine  quickly 
which  face  should  be  deleted  next  from  the  front. 


Figure  3.5:  Insertion  of  Items  into  the  Heap  List 


The  positions  of  the  son  or  the  father  in  the  heap 
list  LBEAP<1:MHBAP)  are  denoted  by  IPSOI,  IPFATH 
respectively.  Accordingly,  the  face-number  of  the 
son  or  the  father  in  the  tree  is  denoted  by  IF- 
SOI,  XFFATS.  Then  IFSOI=LHEAP( IPSOI)  and  IF- 
FATH*LHBAP(XFFATI).  From  Figure  3.5  one  can  see 
that  the  two  sons  of  position  IPFATH  are  located  at 
IPS0H1-  *XPFATH  and  IPS0I2=»2*IPFATHM  respec¬ 
tively.  Assume  that  IFACE  faces  and  RFACB(1  :IFACB) 
associated  keys  are  given.  The  two  main  operations 
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are  adding  and  deleting  a  new  face  to  the  tree  without 
altering  the  ordering. 

3.2. 1.1  Adding  a  new  face  to  the  heap  list 

The  idea  is  to  add  the  new  face  at  the  end  of  the 
tree.  If  necessary,  the  internal  order  of  the  tree  is  re¬ 
established  by  comparing  father  and  son  pairs.  Thus, 
the  tree  is'  traversed  from  the  bottom  upwards. 
Algorithm  ADDHEAP  : 

A.l  Increase  AHEAP  by  one:  NHEAP-IHEAP+1. 

A.2  Place  the  new  face  IFMEW  at  the  end  of  the  heap 
list:  LHEAP(IHEAP)=IFIEV. 

A. 3  Set  the  position  of  the  son  XPSOI  in  the  heap  list 
to: 

IPSOIsIHEAP. 

A.4  Then  the  position  of  the  father  XPFATH  in  the 
heap  list  is  given  by:  IPFATH-IPSOI/2  (integer 
division). 

A.5  The  faces  associated  with  the  positions  of  father 
and  son  are:  IFSOH=LHEAP(IPSO«), 
IFFATH=LHEAP  (  XPFATH  ) . 

A.6  If  RFACE(IFSOH)  <  RFACE(IFFATH): 

A. 6.1  interchange  the  faces  stored  in  LHEAP 
A.6. 2  set  IPSO*=IPFAT. 

A  6.3  unless  IPSQ3=1  (top  of  the  .it),  go  back  to 
step  A.4. 

In  this  way,  the  face  with  the  smallest  associated  key 
RFACE(IFACE)  remains  at  the  top  of  the  list  in  po¬ 
sition  LHEAP  (1).  The  process  is  illustrated  in  Fig¬ 
ure  3.5,  where  the  letters  of  the  word  ‘example’  h* 
been  inserted  sequentially  into  the  heap  list. 

3.2. 1.2 

Removing  the  face  at  the  top  of  the  heap  lis* 

The  idea  is  to  take  out  the  face  at  the  tc,  it  ♦!>>-  hea 
list,  replacing  it  by  the  face  at  the  bottor-  "  tut  he  H 
list.  If  necessary,  the  internal  order  ,  „  ;e- 

establishea  by  comparing  pai  -vV.  ms. 

Thus,  the  tree  is  traversed  from  VitO  VO|<  .. .  ,/ards. 

Algorithm  REMHEAP  : 

R.l  Take  out  the  face  at  the  top  of  the  list:  I- 

FOUT=LHEAP ( 1 ) 

R.2  Place  the  face  stored  at  the  end  of  the  heap  list 
at  the  top: 

LHEAP(l)»LHEAP(lHEAP),and  lower 
■HEAP:  HHEAP=«HEAP-1. 

R.3  Set  the  position  of  the  father  IPFATH  in  the  heap 
list  to 
IPFATB=1 

R.4  Then  the  positions  of  the  two  sons  IPS0I1  and 
IPS0I2  in  the  heap  list  are  given  by:  IP- 

SOU  *2+IPFATH 
and  IPS0I2=IPS011+1. 

R.5  The  faces  associated  with  the  positions  of  father 
and  sons  are: 


IFSOIl 2 LHEAP ( XPS0I1 ) , 

IFS0I2*LHEAP ( IPS012) , 

IFF ATH=LHEAP ( IPFATH ) . 

R.6  Determine  which  son  needs  to  be  exchanged: 

If  RFACE(IFFATH)  <  RFACE( IFSOIl ) . 
RFACECIFS012)  :  set  IPEXCH=0 
If  RFACE(IFFATH)  >  RFACE(IFSOll)  > 
RFACE(IFSQI2)  :  set  IPBICH-IPS0I2 
If  RFACE(IFFATH)  >  RFACE(IFS0I2)  > 
RFACE(IFSOIl) :  set  IPEXCH=I?S0*1 
If  RFACE(IFSOIl)  >  RFACE(IFFATH)  > 
RFACE(IFS012) :  set  IPEXCB»IPS0I2 
If  RFACB(IFS0I2)  >  RFACE(IFFATH)  > 
RFACE(IFSOIl)  : 
set  IPEXCH»IPS0I1 

R.7  Unless  IPEICH=0,  exchange  father  and  son  posi¬ 
tions: 

R.7.1  interchange  the  faces  stored  in  LHEAP 

R.7. 2  set  IPFATH=IPEXCH 

R.7. 3  unless  2*IPFATH  >  IHEAP  (bottom  of  the 
list),  go  back  to  step  R.4. 

In  this  way,  the  face  with  the  smallest  associated  key 
will  again  remain  at  the  top  of  the  list  in  position  L- 
HEAP(l).  The  described  process  is  illustrated  in  Fig¬ 
ure  3.6,  where  the  successive  removal  of  the  small¬ 
est  element  (i  ’phabetically)  from  the  previously  con¬ 
structed  heap  list  is  shown. 


i. 


3  ...  6. 
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Figure  3.6:  Successive  Deletion  of  the 
Smallest  Item  from  the  Heap  List 


It  is  easy  to  prove  that  b^th  the  insertion  and 
the  deletion  of  a  face  into  the  heap  list  will  take 
On  ,j(IHEAP))  operations  [Ds.3,4]  on  the  average. 

I  Quad/Octrees  for  the  Points 

Quadtrees  and  Octrees  have  been  used  extensively  for 
2-D  and  3-D  gr  generators  [Go.l].  Their  main  role 
there  was  to  define  the  objects  to  be  gridded,  and 


not  to  provide  an  0(log(jV))  search  algorithm  for  ar¬ 
bitrary  point  distributions.  The  main  Hess  are  de¬ 
scribed  for  2-D  regions.  The  extension  co  3-D  regions 
io  immediate.  Define  an  array  LQUAD(1:7.HQUAD)  to 
store  the  points,  where  HQUAD  denotes  the  maximum 
number  of  quads  allowed.  For  each  quad  IQ,  store  in 
LQUAD(1:7,XQ)  the  following  information: 

LQUAD(  7,  IQ)  :  <0:  the  quad  is  full 

=  0  :  the  quad  is  empty 
>  0  :  the  number  of  points 
stored  in  the  quad 

Lqr.%D(  a, IQ)  :  >  0  :  the  quad  the  present 

quad  came  from 

LQUID(  6, IQ)  :  >  0  :  the  position  in  the 

quad  the  present 
quad  came  from 

LQUAD(l:4,IQ)  :  for  LQUAD(7,iq)  ;>  0  : 

the  points  stored  in  this  quad 


for  LQUAD(7,IQ)  <  0  : 
the  quads  into  which  the 
present  quad  was  subdivided 

At  most  four  points  are  atored  per  quad.  If  a  fifth 
point  falls  into  the  quad,  the  quad  is  subdivided  into 
four,  and  the  old  points  are  re-located  into  their  re¬ 
spective  quads.  Then  the  fifth  point  is  introduced  to 
the  new  quad  into  which  it  falls.  If  the  quad  is  full 
again,  the  subdivision  process  continues,  until  a  quad 
with  vacant  storage  space  is  found.  This  process  is 
illustrated  in  Figure  3.7.  The  newly  introduced  point 
E  fails  into  the  quad  IQ.  As  IQ  already  contains  the 
four  points  A,B,C  and  D,  the  quad  is  subdivided  into 
four.  Poin*a  A,B,C  and  D  arc  relocated  to  the  new 
quads,  and  point  E  is  added  to  the  new  quad  KQUAD+2. 
Figure  3.7  also  shows  the  entries  in  the  LQUAD-array, 
as  well  as  the  associated  tree-structure. 
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Figure  3.7:  Introduction  of  s  New  Point  into  a  full  Quad 
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In  order  to  find  points  that  lie  inside  a  search  region, 
the  quadtree  is  traversed  from  the  top  downwards. 
In  this  way,  those  quads  that  lie  outside  the  search 
region  are  eliminated  at  the  highest  possible  level. 
An  attractive  feature  of  quadtroes  and  octrees  is  that 
there  is  a  close  relationship  between  the  spatial  loca¬ 
tion  and  the  location  in  the  tree.  This  considerably 
reduces  the  number  of  operations  required  to  find  the 
quads  covering  a  desired  search- region.  It  is  not  dif¬ 
ficult  to  see  that  with  the  quadtree  or  the  octree  it 
takes  0(\om(N))  or  0(logg(JV))  operations  to  locate 
all  points  inside  a  search  region  or  to  find  the  point 
closest  to  a  given  point. 

3.2.3  N-trees  for  the  Face/Point  Search 

N-trees  are  linked  lists  that  are  often  used  ">  re¬ 
late  data  of  different  nature  and  number  of  items. 
For  unstructured  grids  one  could  relate  element,  face, 
side  and  point-data.  In  the  present  case,  a  storage 
scheme  is  sought  to  answer  the  question:  which  are 
the  faces  adjacent  to  a  given  point?  As  the  num¬ 
ber  o!  faces  surrounding  a  point  varies  from  point  to 
point,  but  usually  fluctuates  within  certain  bounds, 
the  following  scheme  appears  attractive.  Define  an 
array  LPOlKl:  IPOII)  over  the  points  and  another 
array  LFAPO(l-JtFSUP, IFAPO),  where  IPOII  denotes 
the  number  of  points,  MFSUP  the  average  number  of 


faces  surrounding  points  (+1),  and  HFAPO  the  maxi¬ 
mum  number  of  storage  locations.  Then  store  in: 


POII< IPOII)  :  the  place  IFAPO  in  LFAPO 

where  the  storage  of  the  faces 
surrounding  point  IPOII  starts. 


LFAPOfKFSUP, IFAPO)  : 


>  0  :  the  number  of 
stored  faces 
<  0  :  the  place  JFAPO 
in  LFAPO 

where  the  storage 
of  the  faces 
surrounding 
point  IPOII 
is  continued 
an  empty  location 
a  face  surrounding 
IPOII 


LFAPO(i:IFSUP-l, IFAPO)  :  =  0  : 

>0  : 


In  2-D  one  typically  has  two  faces  adjacent  to  a  point, 
and  HFSUP“3,  while  for  3-D  meshes  typical  values 
are  KFSUP«8-tO.  Once  this  storage  scheme  has  been 
set  up,  storing  and/or  finding  the  faces  surrounding 
points  is  readily  done.  The  process  of  adding  a  face  to 
the  linked  list  LPOIl/LFAPO  is  shown  diagramatically 
in  Figure  3.8. 
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Figure  3.8:  Iutrodc'^ioii  of  a  Now  Face  into  the  Linked  List 
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3.3  Additional  Techniques  to  Increase  Speed 

There  are  some  additional  techniques  that  can  be  used 
to  improve  the  performance  of  the  advancing  front 
grid  generator.  The  most  important  of  these  are: 

a)  Filtering:  Typically,  the  number  of  close  points 
and  faces  is  far  too  conservative,  i.e.  large.  As  an 
example,  consider  the  search  for  close  points:  there 
may  be  up  to  eight  points  inside  an  octant,  but  of 
these  only  one  m  ,y  be  close  to  the  face  to  be  taken 
out.  The  idea  is  to  filter  out  these  ‘distant’  faces  and 
points  in  order  to  avoid  extra  work  afterwards.  While 
the  search  operations  are  difficult  to  vectorize,  these 
filtering  operations  lend  themselves  to  vectorization 
in  a  straightforward  way,  leading  to  a  considerable 
overall  reduction  in  CPU  requirements. 

b)  Automatic  Reduction  of  Unused  Points:  As  the 
front  advances  into  the  domain  and  more  and  more 
tetrahedra  are  generated,  the  number  of  tree-levels 
increases.  This  automatically  implies  an  increase  in 
CPU-time,  as  more  steps  are  required  to  reach  the 
lower  levels  of  the  trees.  In  order  to  reduce  this  CPU- 
increase  as  much  as  possible,  all  trees  are  automatical¬ 
ly  restructured.  All  points  which  are  completely  sur¬ 
rounded  by  tetrahedra  are  eliminated  from  the  trees. 
This  procedure  has  proven  to  be  extremely  effective. 
It  reduces  the  asymptotic  complexity  of  the  grid  gen¬ 
erator  to  less  than  0(N  log  N).  In  fact,  in  most  prac¬ 
tical  cases  one  observes  a  linear  O(A')  asymptotic 
complexity,  as  CPU  is  traded  between  subroutine  cal- 
1  overheads  and  less  close  faces  on  average  for  large 
problems. 

c)  Global  h-refinement:  While  the  basic  advancing 
front  algorithm  is  a  scalar  algorithm,  h- refinement  can 
be  completely  vectorized.  Therefore,  the  grid  gen¬ 
eration  process  can  be  made  considerably  faster  by 
first  generating  a  <  oarser,  but  stretched  mesh,  and 
then  refining  globally  this  first  mesh  with  classic  h- 
refinement  [Ae.6,7].  Typical  speed-ups  achieved  by 
using  this  approach  are  1:6  to  1:7. 

3.4  Additional  Techniques  to  Enhance  Reliability 

The  advancing  front  algorithm  described  above  may 
still  fail  for  some  pathological  cases.  The  newcomer 


should  be  reminded  that  in  3-D,  even  the  slightest 
chance  of  something  going  astray  has  to  be  accounted 
for.  In  a  mesh  of  over  a  million  tetrahedra  (common 
even  for  Euler-runs),  any  slight  possibility  becomes 
a  reality.  The  following  techniques  have  been  found 
effective  in  enhaciug  the  reliability  of  advancing  front 
grid  generators  to  a  point  where  they  can  be  applied 
on  a  routine  basis  in  a  production  environment: 

a)  Avoidance  of  bau  elements  during  generation:  It  is 
important  not  to  allow  any  bad  elements  to  be  creat¬ 
ed  during  the  generation  process.  These  bad  elements 
can  cause  havoc  when  trying  to  introduce  further  ele¬ 
ments  at  a  later  stage.  Therefore,  if  a  well-shaped 
tetrahedron  can  not  be  introduced  for  the  current 
face,  the  face  is  skipped. 

b)  Sweep  and  Retry:  If  any  faces  where  new  ele¬ 
ments  could  not  be  introduced  remain  in  the  field, 
these  regions  are  enlarged  and  remeshed  again.  This 
’sweep  and  retry’  technique  has  proved  extremely  ro¬ 
bust  and  reliable.  It  has  also  made  smoothing  of 
meshes  possible:  if  elements  with  negative  or  smal- 
1  Jacobian s  appear  during  smoothing  (as  is  the  case 
with  most  spring-analogy  smoothers),  these  elements 
are  removed.  The  unmeshed  regions  of  space  are  then 
regridded.  By  being  able  to  smooth,  the  mesh  quality 
is  improved  substantially,  leading  to  better  results  in 
field  solvers. 

3.5  Some  Examples 

3.5.1  Multi- Element  Airfoil  Configuration:  Figure  3.9 
shows  a  multi-element  airfoil  case.  Figure  3.9a  gives 
the  boundary  information,  Figure  3.9b  the  back¬ 
ground  grid  used  and  Figure  3.9c  the  quadtree  ob¬ 
tained  for  the  background  grid.  Figures  3.9d,e  show 
the  constructed  mesh  before  smoothing,  as  well  as  the 
associated  quadtree.  This  grid  is  then  smoothed  using 
a  spring  system  analogy,  producing  the  grid  shown  in 
Figure  3.9f. 

Currently,  the  advancing  front  algorithm  constructs 
grids  at  a  rate  of  25,000  tetrahedra  per  minute  on 
the  CRAY-XMP  or  CRAY-2.  With  one  level  of  h- 
refinement,  the  rate  is  190, 000  to  200,000  tetrahedra 
per  minute.  This  rate  is  essentially  independent  of 
grid-size,  but  may  decrease  for  very  small  grids. 


Figure  3.9;  Multi- Element  Airfoil  Configuration 
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4.  ADAPTIVE  REFINEMENT 

Beside*  their  ability  to  discretize  accurately  complex 
geometric*,  a  second  very  attractive  feature  of  un¬ 
structured  grids  is  the  ease  with  which  adaptive  re¬ 
finement  can  be  incorporated  into  them.  The  addition 
of  further  degree*  of  freedom  doe*  not  destroy  any  pre¬ 
vious  structure.  Thus,  the  flow  solver  requires  no  fur¬ 
ther  modification  when  operating  on  an  adapted  grid. 
For  many  practical  problems,  particularly  those  with 
travelling  shocks  or  flapping  wakes,  the  regions  that 
need  to  be  refined  are  extremely  small  as  compared 
to  the  overall  domain.  Therefore,  the  savings  in  stor¬ 
age  and  CPU-requitements  typically  range  between 
10-100,  as  compared  to  an  overall  fine  mesh  [Atn.l- 
4,Ae. 1-12, Ar. 1-5].  Experience  indicates  that  for  prob¬ 
lems  falling  in  this  class,  adaptive  refinement  makes 
the  difference  between  being  or  not  being  able  to  run 
the  problem  to  an  acceptable  accuracy  in  a  reason¬ 
able  time  [Ae.S- 10] .  Without  it,  one  would  be  forced 
to  use  much  coarser  grids,  with  lower  accuracy,  for 
the  same  expense. 

On  the  other  hand,  one  should  not  always  cxpec- 
t  such  big  gains  from  adaptive  refinement.  As  an 
example,  consider  the  repetitive  simulation  of  iuvis- 
cid  subsonic  flow  past  airfoils.  For  this  application,  a 
well-chosen  O-mesh  already  gives  a  near-optimal  dis¬ 
cretization.  Therefore,  except  perhaps  at  leading  and 
trailing  edges  and  possible  shock- regions,  adaptation 
will  not  give  dramatic  gains  in  performance. 

The  hidden  advantage  of  adaptive  refinement  is  that 
it  frees  the  user  from  having  to  waste  time  choos¬ 
ing  a  good  initial  grid.  With  adaptation,  any  ini¬ 
tial  grid  will  be  transformed  into  a  near-optimal  dis¬ 
cretization.  Thus,  adaptation  adds  a  new  dimension 
of  user-friendliness  to  the  CFD  process  that  was  not 
there  previously. 

Any  adaptive  refinement  scheme  is  composed  of  three 
main  ingredients.  These  are 

1)  an  optimal-mesh  criterion, 

2)  an  error  indicator,  and 

3)  a  method  to  refine  and  coarsen  the  mesh. 

They  give  answers  to  the  questions 

1)  how  should  the  optimal  mesh  be  ? 

2)  where  is  refineraent/coarseuing  required  ? 

3)  how  should  the  refinement /coarsening  be  accom¬ 
plished  ? 

The  topic  of  adaptation  being  now  a  decade  old,  it 
is  not  surprising  that  a  variety  of  answers  have  been 
proposed  by  several  authors  for  each  of  these  ques¬ 
tions.  In  the  following,  the  most  successful  ones  are 
discussed  in  more  depth. 

4.1  Qntimal  Mesh  Criteria 

The  most  common  optimal  mesh  criteria  employed 
are: 

a)  Eauidistribution  of  Error:  The  aim  is  to  attain  a 
grid  in  which  the  error  is  uniformly  distributed  in  »- 
pace.  One  cam  show  ihat  such  a  mesb  has  the  smallest 


numbers  of  degrees  of  freedom  (i.c.  the  smallest  num¬ 
ber  of  elements)  for  the  general  aim: 

— »  min  Vx  €  0  .  (4.1) 

Conceptually,  one  can  derive  this  criterion  from  the 
observation  that  the  error  will  have  the  irregular  dis¬ 
tribution  for  the  fitst  mesh  shown  in  Figure  4.1a.  if 
the  number  of  degrees  of  freedom  is  kept  the  same, 
tiie  distribution  of  element  size  and  shape  is  all  that 
may  be  varied.  After  repositioning  of  points,  the  er¬ 
ror  distribution  in  space  will  become  more  regular, 
as  shown  in  Figure  4.1b.  One  can  also  see  that  the 
general  aim  stated  in  Eqn.(4.1)  will  be  achieved  when 
the  error  is  constant  in  the  domain. 


a>  (lofore  Apaptalion 


o.  ror 


l_l — | — | — h+H-H-W — ! — I — ) 


P>  -Nf to,  Adaptation 


Fituye  4,1:  Optimal  Mesh  Criterion 

b)  Local  Absolute  Error  Tolerances:  In  many  practi¬ 
cal  applications,  the  required  error  tolerances  may  not 
be  the  same  at  all  locations.  Moreover,  instead  ot '  '- 
ing  the  general  minimization  stated  in  Eqn.(4.1),  oju 
may  desire  to  enforce  absolute  local  bounds  in  certain 
regions  of  the  domain: 

(h  <  c,  V*  €  0,u»  .  (4.2) 

Meeh  refinement  or  coarsening  would  then  take  place 
if  the  local  error  indicator  exceeds  or  falls  below  given 
refinement  or  coarsening  tolerance*: 

(h  >  e,  =*■  refine  ,  t*  <  cc  =>  coarsen. 

4.2  Error  Indicators/Estimators 

Consider  the  task  of  trying  to  determine  if  the  solu¬ 
tion  obtained  on  the  present  mesh  is  accurate.  In- 
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tuitively,  a  number  of  criteria  immediately  come  to 
mind;  variations  of  key-variables  within  elements, 
entropy-levels,  higher-order  derivatives  of  the  solu¬ 
tions,  etc.  AH  of  them  make  the  fundamental  as¬ 
sumption  that  the  solution  on  the  present  mesh  is 
already  in  some  form  ‘close’  to  the  exact  solution. 
This  assumption  is  reasonable  for  parabolic  and  el¬ 
liptic  problems,  where,  due  to  global  minimization 
principles,  local  deficiencies  in  the  mesh  have  only  a 
local  elfect.  For  hyperbolic  problems,  the  assumption 
ss  ti  may  be  completely  erroneous.  Consider  an 
airfoil  at  high  angle  of  attack.  A  coarse  initial  mesh 
may  completely  miss  local  separation  bubbles  at  the 
leading  edge  that  lead  to  massive  separation  in  the 
back  portion  of  the  airfoil.  Thus,  any  error  indicator 
presently  in  use  (and  f  really  mean  any)  would  mis- 
s  these  features,  performing  adaptation  at  the  wrong 
places.  On  the  other  hand,  the  assumption  uh  as  u  is 
a  very  reasonable  one  for  most  initial  grids.  As  a  mat¬ 
ter  of  fact,  it  is  not  so  difficult  to  attain,  particularly 
if  a  similar  problem  has  been  solved  before. 

4.2.1  Popular  Error  Indicators: 

The  most  popular  error  indicators  presently  used  in 
production  codes  may  be  grouped  into  the  following 
categories: 

1.1  Jumna  in  Indicator  Variables:  The  simplest  error 
indicator  is  obtained  by  simply  looking  at  the  jump  of 
some  indicator  variable  like  the  Mach-numbcr,  densi¬ 
ty,  or  entropy  within  an  element.  The  underlying  as¬ 
sumption  is  that  in  those  regions  where  these  jumps 
ate  large,  more  elements  are  re  mired.  This  assump¬ 
tions  fails  at  shocks,  where  tae  jump  will  stay  the 
same  no  matter  now  fine  the  mesh  is  made.  Never¬ 
theless,  error  indicators  of  this  form  have  been  used  in 
industrial  applications  with  success  (Ae.3,Ae.l  1,12). 

1.2  Interpolation  Theory:  Making  the  assumption 
that  the  solution  is  smooth,  one  may  approximate 
the  error  in  each  elements  by  a  derivative  one  order 
higher  than  the  element  shape  function.  For  1-D,  this 
would  result  in  an  error  indicator  at  the  element  level 
of  the  form 

<J'  =  C,hPS  1  (43) 

where  the  p  —  th  derivative  is  obtained  by  some  re¬ 
covery  procedure,  and  for  linear  elements  p  =  2.  The 
total  error  in  the  computational  domain  is  the  given 
by 

‘“"’[/‘’s M*  (4,) 

1.3  Comparison  of  Derivatives:  Again  making  the  as¬ 
sumption  that  the  solution  is  smooth,  one  may  com¬ 
pare  significant  derivatives  using  schemes  of  different 
order.  As  an  example,  consider  the  following  two  ap¬ 
proximations  to  a  second  derivative: 


=  ^2  (w.'-i  -  2t»j  (d.5a) 

».«!«  =  —  4-  l«ttj+i  —  Uj+j) 

(4.5b) 

The  assumption  of  smoothness  in  u  would  allow  a 
good  estimate  of  the  error  in  the  second  derivatives 
from  the  difference  of  these  two  expressions.  For  un¬ 
structured  grids,  oue  may  recover  these  derivatives 
with  reconstruction  procedures. 

1.4  Residuals  of  PDEs  on  Adjacent  Grids:  Assume 
we  have  a  node-centered  scheme  to  discretize  the 
PDEs  at  hand.  At  steady  state,  the  residuals  at  the 
nodes  will  vanish.  On  the  other  hand,  if  the  residu¬ 
als  are  evaluated  at  the  element  level,  non- vanishing 
residuals  are  observed  in  the  regions  that  require  fur¬ 
ther  refinement.  This  error  indicator  has  been  used 
extensively  in  France  [Ac.  4],  Another  possibility  is 
to  check  locally  the  effect  of  higher  order  shape  func¬ 
tions  introduced  at  the  clement  level  or  at  element 
boundaries.  These  so-called  p-refineinent  indicators 
have  been  used  extensively  for  structural  FEM  appli¬ 
cations  [Ae. 13,14]. 

All  of  these  error  indicators  have  been  used  in  prac¬ 
tice  to  guide  mesh  adaptation  procedures.  They  all 
work  for  their  respective  area  of  application.  It  seems 
that  the  derivation  of  a  proper  error  indicator  is  not 
a  difficult  task.  The  analyst  usually  knowns  how  to 
discern  a  good  solution  from  a  bad  one.  An  error 
indicator  built  on  this  knowledge  has  to  work  ! 

4.2.2  Transient  Compressible  Flows 

Transient  Compressible  Flows,  with  their  travelling 
shocks  of  widely  different  strengths  which  need  adap¬ 
tation  every  5-10  timesteps  require  more  refined  error 
indicators.  Design  criteria  for  these  error  indicators 

arc: 

a)  The  error  indicator  should  be  fast. 

b)  The  error  indicator  should  be  dimensionless,  so 
that  several  ‘key  variables’  can  be  monitored  at 
the  same  time. 

c)  The  error  indicator  should  be  bounded,  so  that 
no  further  user  intervention  becomes  necessary 
as  the  solution  evolves. 

d)  The  error  indicator  should  not  only  mark  the  re¬ 
gions  with  strong  shocks  to  be  refined,  but  al¬ 
so  weak  shocks,  contact  discontinuities  and  other 
‘weak  features’  in  the  flow. 

All  of  the  popular  error  indicators  described  above  are 
not  dimensionless.  This  implies  that  strong  shocks 
produce  large  error  indicators,  whereas  weak  shocks 
or  contact  discontinuities  produce  small  ones.  Thus, 
in  the  end,  only  the  strong  shocks  would  be  refined, 
losing  the  weak  features  of  the  flow.  An  error  indica¬ 
tor  that  meets  the  design  criteria  a)-d)  was  proposed 


in  (Ae.C).  In  general  terms,  it  is  of  the  form 


—  h2  |sccon<<  deritia(tt;ca| 

h  |/irs<  <fertt>«i<M'ea|  4-  t  |m«an  tntftie| 

(-1  6) 

By  dividing  the  second  derivatives  by  the  absolute 
value  of  the  first  derivatives  the  error  indicator  be¬ 
comes  bounded,  dimeusionless,  and  the  'eating  up* 
effect  of  strong  shocks  is  avoided.  The  terms  follow¬ 
ing  t  are  added  as  a  'noise*  filter  in  order  not  to  refine 
'wiggles’  or  'ripples’  which  may  appear  due  to  loss  of 
thonotonicity.  The  value  for  e  thus  depends  on  tile 
algorithm  chosen  to  solve  the  1*1) Ks  describing  the 
physical  process  at  hand.  The  multidimensional  form 
of  this  error  indicator  is  given  by 


DialW+t-f/il  +  llfi-W-il  (4.96) 

(4.9c) 

the  error  indicator  on  the  present  (old)  grid  E0>lt  is 
given  by: 


Ef“  = 


(4.10) 


Tire  reduction  of  the  current  clement  awe  hoM  by  a 
fraction  {  to  hu,w  —  £  •  h0,li  will  yield  a  new  error 
indicator  of  the  form 


Li  - 


oH2 


(4.11) 
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Given  the  desired'  error  indicator  value  E'""“  for  the 
improved  mesh,  the  reduction  factor  £  is  given  by: 


i  Ekl!(/rti^ip^i +<(iAf.fn^i)j  dft)»‘ 

(4.7) 

where  N1  denotes  the  shape-function  of  node  I.  This 
error  indicator  has  performed  very  well  in  2-D  over 
tho  years  [Ae.6-10,Ar.2-5],  However,  when  first  used 
in  3-Q,  it  proved  unreliable.  The  source  for  this  seem¬ 
ingly  inconsistent  behaviour  was  found  to  stem  from 
the  large  local  variations  in  element  size,  shape,  as 
well  as  number  of  element  surrounding  a  point  en¬ 
countered  in  typical  3-D  unstructured  grids.  These 
will  produce  large  variations  of  the  second  term  in  the 
denominator  which  are  not  based  on  physics,  but  on 
the  mesh  structure  itself.  The  solution  was  to  modify 
this  error  indicator  as  follows: 


E‘  = 


1 


Ei.j  (fa  NfNUn  •  tO)» 

£*,»(/o  \n{\  |  N~'Uj\<my  +  €MMihT*\U'\ ' 


(48) 

where  A/Af/  is  the  lumped  mass-matrix  at  point  /, 
and  hi  the  average  element  length  at  point  /.  This  er¬ 
ror  indicator  has  proven  to  be  remarkably  insensitive 
to  local  variations  in  element  site  and  shape,  while 
still  yielding  the  correct  indicator  values  for  physi¬ 
cal  phenomena  of  interest.  This  good  performance 
is  attributed  to  the  smoothing  effects  of  two  averag¬ 
ing  operations  working  simultaneously:  the  lumped 
mass-matrix  and  the  point-lenghU. 


4  2.2.1  Determination  of  Element  Sizes 

If  all  that  is  required  is  a  thresholding  for  refinement 
or  coarsening,  the  error  indicator  given  by  Eqn.(4.8)  is 
sufficient.  On  the  other  hand,  one  may  wish  to  obtain 
a  more  precise  estimation  of  the  required  element  sire 
to  meet  a  certain  tolerai-.ee  (e.g.  to  use  within  an 
adaptive  remeshing  context).  In  this  case,  a  more 
precise  analysis  is  required.  Defining  the  derivatives 
according  to  order  as: 


0?  =  (|tt+i|  +  2  -  \Vi\  +  \Vi-i\)  (4.9a) 


E""*  1 
2 

(4.12) 

Observe  that  for  a  smooth  solution  with  Dx  <  D°, 
this  results  in  f  =  (J5n*u7JP‘>,<,)°  \  consistent  with 
the  second  order  accuracy  of  linear  elements.  Close 
to  discontinuities  Dx  >  D°,  and  one  obtains 

£  =  Bnt'“ / EeU ,  consistent  with  the  first  order  er¬ 
ror  obtained  in  these  regions. 

This  error  indicator  can  be  generalized  to  multidimen¬ 
sional  situations  by  defining  the  following  tensors: 

(&)i,  ***<*[  \N$,\\N$\\V,\dCl  ,  (4.13) 

Jn 

(»%  =  **  I  \N^\\Nf,Uj\dtt  , 

Jn 

(D%  =  hJ|  f  N^NjtKl  Uj  |  .  (4.11) 

Jn 

which  yield  an  error  matrix  E  of  the  form: 


f'zT  Eyf  Elf  "I 

P3 

o 

o 

E  =  i 

Fjty  Ey y  1  —  X-  < 

0  E>2  o 

,  tf<  Eyt  E,,  J 

[  o  0  t:3  a  J 

(415) 

The  principal  eigenvalues  of  this  matrix  arc  then  used 
to  obtain  reduction  parameters  £j>j>  in  the  three  as¬ 
sociated  eigenvector  directions.  Due  to  the  symmetry 
of  E,  this  is  an  orthogonal  system  of  eigenvectors  that 
defines  a  local  coordinate  system. 

4. 2.2.2  Smoothing  of  Element  Size  and  Stretchings 

As  mentioned  above,  it  is  very  important  in  the  con¬ 
text  of  transient  problems  to  generate  grids  which 
do  not  exhibit  minimum  element  sizes  that  are  much 
smaller  than  the  prescribed  minimum  element  site. 
Practical  calculations  indicate  that  the  grids  pro¬ 
duced  by  simply  taking  the  described  error  indica- 
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tor  and  the  resulting  distribution  of  element  sites, 
stretchings  and  stretching  directions  did  not  meet  this 
requirement,  fn  other  words,  they  were  too  irregular. 
Far  superior  grids  were  obtained  by  smoothing  and 
limiting  the  initial  distributions  obtained  for  element 
sizes,  stretchings  and  stretching  directions. 


Smoothing  of  Element  lengths:  The  element  length 
h  is  a  scalar  quantity  Within  each  smoothing  pass 
over  the  mesh,  the  following  operations  are  per¬ 
formed: 


Ss.l  Tor  each  clcmont  cl-  take  the  average  clement 
length  h*f*  over  its  nodes; 


where  i  =  I 
ment  t.l. 


.  nnotl 

V  -  E  *•>  ■ 


(4.1(5) 
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nnotl  represent  the  mdcs  of  ele- 


Se.2  For  each  point  i:  form  the  average  element  length 
/?v*  over  its  surrounding  elements: 


lfvt  - 
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(4.17) 


where  el  ~  1 . nauef  represent  the  elements 

surrounding  node  i. 

Ss.3  For  each  point  i:  obtain  the  now  element  length 
hi  from 

hi  -  min{lfv\h()  .  (4.18) 

Note  that  a  'point  average’  is  taken,  and  not  ait 
‘area  weighted  average’.  This  is  important,  as  it 
limits  the  infuence  of  large  element*  over  smal- 
l  ones.  One  can  also  observe  that  the  element 
length  is  never  allowed  to  increase  at  any  given 
point. 


Smoothing  of. Stretchings  and  Stretching  Directions: 
The  stretching  direction  a  is  a  vector  quantity.  There¬ 
fore,  problems  may  arise  when  trying  to  smooth 
stretching  directions.  A  typical  case  is  shown  in  Fig* 
nre  4.2.  Suppose  an  average  stretching  vector  in  the 
element  is  desired.  In  order  to  accentuate  the  stretch¬ 
ing  direction  with  the  maximum  stretching,  we  mul¬ 
tiply  each  stretching  vector  with  its  corresponding 
stretching  factor.  Simple  averaging  would  then  yield 
the  completely  erroneous  element  stretching  direction 
denoted  by  su  in  Figure  4.2.  In  order  to  avoid  this 
problem,  the  following  algorithm  is  employed: 

Sv.l  For  each  element  el:  compute  the  maximum 
stetching  s™‘J*  encountered  at  the  nodes: 


•T*  -  ■  j  •  (4.19) 

41  « ~  ,  nnoel  ' 

Sv.2  For  each  element  ei.  form  the  average  element 
stretching  s*tv  as 

j  nnoel 
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For  the  example  shown  in  Figure  4  2,  this  yields 


the  vector  denoted  ae  a 

Sv.3  For  each  point i:  compute  the  maximum  stetch¬ 
ing  si’1"*  over  its  surrounding  elements: 


Amdf 


max 

cl  vt  l,nsud 
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(4.21) 


Sv.-t  For  each  point  »:  form  the  average  element 
Rtrctching  s,  over  its  surrounding  elements: 

J  ll*U«l 

=  -^i  £  9,1  *»>(C*  •**!)  ■ 

As  before,  a  point  average  proves  more  effective  than 
an  area  weighted  average,  as  it  limits  the  infuence  of 
large  elements  over  small  ones.  For  typical  runs,  three 
to  four  smoothing  passes  over  the  mesh  are  petfomed 
for  the  element  lengths  and  stretchings. 
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4.3  Refinement  Strategies 

Reside#  the  aim  of  refinement  and  tin*  error  indi- 
•'■'itor/fxtimator,  the  third  ingredient  of  any  adap¬ 
tive  refinement  method  is  the  ■  slinement  strategy,  i.e. 
how  to  retine  a  given  mesh.  Three  different  families 
of  refinement  strategies  have  been  proposed  to  date: 

S.l  Mesh  Movement  or  Repositioning  <r-nu;tl>odaV 
The  aim  is  to  reposition  the  points  in  the  field  in 
order  to  obtain  i  better  discretimor  for  the  prob¬ 
lem  at  hat/  The  regions  where  more  elements  are 
required  tend  to  draw  points  and  elements  from  re¬ 
gions  where  a  coarser  mesh  can  be  tolerated.  Two 
basic  approaches  have  been  used  to  date: 

a)  Spring  Systems,  whereby  the  mesh  is  viewed  as  a 
system  of  springs  whose  stiffness  is  proportional 
to  the  error  indict  tor,  and 

b)  The  Moving  Finite  Clement  Method,  where  the 
position  of  poin  is  viewed  as  further  unknowns 
in  a  general  functioned  to  be  minimized. 

Mesh  movement  schemes  art  relatively  simple  to  code, 
as  the  mesh  topology  is  not  allowed  to  change.  On 
the  other  hand,  they  are  not  flexible  and  general  e~ 
dough  for  complex  production  runs  which  may  have 
many  shocks.  They  have  been  mectic  red  here  for 
completeness,  and  are  not  recommended  lor  practical 
applications. 


5.2  Mesh. Enrichment  (h/p-methods):  In  this  case, 
degrees  of  freedom  ate  added  or  taken  from  a  mesh 
according  to  some  rule,  One  may  either  split  elements 
into  new  ones  (h-reftnemcnt);  or  add  further  degrees 
of  freedom  with  hierarchical  shape-functions  (Fig¬ 
ure  4.3).  The  same  may  be  accomplished  with  the  ad¬ 
dition  of  higher  order  shape- functions  (p-refmement), 
again  cither  conventional  oner  or  hierarchical  ones. 
For  elliptic  systems  of  PDEs,  the  combination  of  h- 
and  p-rcfincmcnt  leads  to  exponential  convergence 
rates  (Ac.  14]. 

5.3  Ucm<»biim.l in-mcthods}:  Finally,  one  may  use  an 
advanced  unstructured  grid  generator  in  combination 
with  an  error  indicator  to  remesh  the  computational 
domain  either  globally  or  locally,  in  order  to  produce 
a  more  suitable  discretization. 

4.3.1  ftansientComorcssiblcFlows 

As  before,  if  we  consider  transient  compressible  flows 
with  travelling  shocks  of  widely  different  strengths 
which  require  adaptation  every  5-10  timesteps,  we  are 
faced  with  more  constraints  than  the  usual  steady- 
state  flow  application.  Design  criteria  for  refinement 
strategies  for  these  types  of  applications  are. 
a)  'l'lie  method  should  be  conservative,  i.e.  a  mesh 
change  should  not  result  in  the  production  or  loos 
of  mass,  momentum  or  enetgy. 
l>)  The  method  should  have  a  minimal  amount  of 
numerical  dissipation,  as  many  mesh  changes  are 
required  during  the  course  of  one  simulation, 

c)  The  method  should  not  produce  elements  that 
are  too  small,  us  this  would  reduce  too  severely 
the  allowable  timestep  of  the  explicit  flow  solvers 
employed. 


d)  The  method  should  be  fast.  In  particular,  it 
should  lend  itself  to  a  high  degree  of  parallelism. 

e)  The  method  should  not  involve  a  major  storage 
overhead. 

These  criteria  severely  limit  the  Add  of  applicable  a- 
trategies.  From  the  results  repotted  in  the  literature, 
one  can  observe  that  only  the  simplest  and  fastest 
of  all  lefinciiieiit  strategies,  i.e.  simple  h-refinemnnt 
with  only  one  level  of  refinenient/coarsening  per  mesh 
change,  has  .iver  made  it  into  a  production  environ¬ 
ment.  The  reasons  are  obvious: 
h.l)  Conservation  presents  no  problem  for  h- 
tcfincmcnt. 

h.2)  No  interpolations  other  than  the  onus  natural¬ 
ly  given  by  the  element  shape-functions  are  re¬ 
quired.  Therefore,  no  numerical  diffusion  is  in¬ 
troduced  by  the  adaptive  refinement  procedure, 
This  is  contrast  to  adaptive  remeshi-g,  where 
the  grids  before  and  after  a  mesh  change  may 
not  have  the  same  points  in  common.  The  re¬ 
quired  interpolations  of  the  unknowns  will  result 
in  an  increased  amount  of  numerical  diffusion 
(sec  [Ar.2]). 

h.3)  H- refinement  is  very  well  suited  to  vector  and 
parallel  processors.  This  is  of  particular  im¬ 
portance  in  the  present  context,  where  a  mesh 
change  is  performed  every  5-10  timesteps,  and 
a  large  percentage  of  mesh  points  is  affected  in 
each  mesh  chagc. 

h.4)  H-refiiienieiit  is  more  robust  than  remesh  mg 
Particularly  in  3-1).  the  amount  of  things  that 
can  go  wrong  seems  to  be  much  less  than  when 
remeshing. 

I  have  tried  other  methods  for  this  class  of  problems, 
like  remeshing,  Many  problems  can  be  solved  success- 
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fully  by  them,  but  experience  indicates  that  one  can 
never  be  sure.  Particularly  if  numerical  diffusion  due 
to  reinterpolation  is  present,  contact  discontinuities 
may  simply  disappear,  triple  points  may  become  dis¬ 
torted,  and  jets  may  spread  out  beyond  recognition. 

4.4  H-refinement  with  Tetrahedra 

Because  of  its  importance  in  practical  calculations, 
and  as  a  tutorial  example  of  what  is  typically  re¬ 
quired  for  an  h-refinement  strategy,  the  h-refinement 
with  tetrahedra  is  described  in  more  detail.  A9  stat¬ 
ed  above,  the  number  of  refinement/coarsening  levels 
per  mesh  change  is  limited  to  one.  Moreover,  refine¬ 
ment  of  a  tetrahedron  is  only  allowed  into  two  (along 
a  side),  four  (along  a  face)  or  eight  new  tetrahedra. 
These  cases  are  denoted  as  1:2,  1:4  and  1:8  respec¬ 
tively.  At  the  same  time,  a  1:2  or  1:4  tetrahedron 
ca:  only  be  refined  further  to  a  1:4  tetrahedron,  or 
by  first  going  back  to  a  1:8  tetrahedron  with  subse¬ 
quent  further  refinement  of  the  8  sub-elements.  We 
call  these  the  2:4,  2:8+  and  4:8+  refinement  cases. 
The  refinement  cases  are  summarised  in  Figure  4.4. 
This  restrictive  set  of  refinement  rules  is  necessary  to 
avoid  the  appearance  of  ill-deformed  elements.  Al  the 
same  time,  it  considerably  simplifies  the  refinemen- 
t/  coarsening  logic.  An  interesting  phenomenon  that 
does  not  appear  in  2-D  is  the  apparently  free  <•’  jice 
of  the  inner  diagonal  for  the  1 :8  refinement  case.  As 
shown  in  Figure  4.5,  one  can  place  the  inner  four  el¬ 
ements  around  the  inner  diagonals  5-10,  6-8,  or  7-9. 
In  the  present  case,  the  shortest  inner  diagonal  was 
chosen.  This  choice  produces  the  smallest  amount  of 
distorted  tetrahedra  in  the  refined  grid.  When  coars¬ 
ening,  again  only  a  limited  number  of  cases  that  are 
compatible  with  the  refinement  is  allowed.  Thus,  the 


coarsening  cases  become  8:4,  8:2,  8:1,  4:2,  4:1,  2:1. 
These  coarsening  cases  are  summarized  in  Figure  4  .6. 
When  constructing  the  algorithm  to  refine  or  coarsen 
the  grid  one  faces  the  usual  decision  of  speed  versus 
storage.  The  more  information  from  the  previous  grid 
in  stored,  the  faster  the  new  grid  may  be  construct¬ 
ed.  In  the  present  case,  this  was  accomplished  by  a 
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a)  From  Sides  to  Points 


b)  From  Points  to  Sides 


Figure  4.6:  Algorithm  to  Screen  for  Admissible  Refinement  Cases 


modified  tree-structure  which  requires  twelve  integer 
locations  per  element  in  order  to  identify  the  ‘paren- 
t’  and  'son'  elements  of  any  element,  as  well  as  the 
element  type. 

The  first  seven  integers  store  the  new  elements  (‘son¬ 
s’)  of  an  element  that  has  been  subdivided  into  eight 
(1:8).  For  the  1:4  and  1:2  cases,  the  sous  are  also 
stored  in  this  allocated  space,  and  the  remaining  in¬ 
teger  locations  are  set  to  zero. 

In  the  eigth  integer,  the  element  from  which  the 
present  element  originated  (the  ‘parent’  element)  is 
stored. 

The  ninth  integer  denotes  the  position  number  in  the 
parent  clement  from  which  this  element  came. 

The  tenth  integer  denotes  the  element  type.  One  can 
either  have  parents  or  sons  of  1:8,  1:4  or  1:2  tetra- 
hedra.  These  are  marked  by  a  positive  value  of  the 
element  type  for  the  parents,  and  a  negative  value  for 
the  sons.  Thus,  for  example,  the  son  of  a  1:8  element 
would  be  marked  as  -8. 

Finally,  in  the  eleventh  and  twelveth  integer  location, 
the  local  and  global  refinement  levels  are  remembered. 

These  twelve  integer  locations  per  element  are  suf¬ 
ficient  to  construct  further  refinements  or  to  recon¬ 
struct  the  original  grid.  It  is  clear  that  in  these  twelve 
integers  a  certain  degree  of  redundancy  is  present.  For 
example,  the  information  stored  in  the  10th  integer 
could  be  recovered  from  the  data  stored  in  locations 
1:8  and  11:12.  However,  this  would  require  a  number 
of  non-vcctorizable  loops  with  many  IF-testa.  There¬ 
fore,  it  was  decided  to  store  this  value  at  the  time  of 
creation  of  new  elements  instead  of  recomputing  it  at 
a  later  time.  Similarly,  the  1 1th  integer  can  be  recov¬ 
ered  from  the  information  stored  in  locations  1:8  and 
12.  As  is  the  case  with  the  10th  integer,  storage  was 
traded  for  CPU-time. 

4.4.1  Algorithmic.  Implementation 

Having  outlined  the  basic  refineiuent/coarsening  s- 
trategy,  its  algorithmic  implementation  can  now  be 


described  in  more  depth.  One  complete  grid  change 
requires  algorithmically  the  following  five  steps: 

1)  Construction  of  the  missing  grid  information 
needed  for  a  mesh  change  (basically  the  sides  of 
the  mesh  and  the  sides  adjoining  each  element); 

2)  Identification  of  the  elements  to  be  refined; 

3)  Identification  of  the  elements  to  be  deleted; 

4)  Refinement  of  the  grid  where  needed; 

5)  Coarsening  of  the  grid  where  needed. 

4  4.1  1  Construction  of  Missing  Grid  Information 

The  missing  information  consists  of  the  sides  of  the 
mesh  and  the  sides  belonging  to  each  element.  The 
sides  are  dynamically  stored  in  two  arrays,  one  con¬ 
taining  the  two  points  each  side  connects  and  the  oth¬ 
er  one  (a  pointer-array)  containing  the  lowest  side- 
number  reaching  out  of  a  point.  The  formation  of 
these  two  arrays  is  accomplished  in  three  main  loop- 
s  over  the  elements,  which  are  partially  vectorizable. 
After  having  formed  these  two  side-arrays,  a  further 
loop  over  the  elements  is  performed,  identifying  which 
sides  belong  to  each  element. 

4. 4. 1.2  Identification  of  Elements  to  be  Refined 

The  aim  of  this  sub-step  is  to  determine  on  which 
sides  further  gridpoints  need  to  be  introduced,  so  that 
the  resulting  refinement  patterns  on  an  element-level 
belong  to  the  allowed  cases  listed  above,  thus  produc¬ 
ing  a  compatible,  valid  new  mesh.  Five  main  steps 
are  necessary  to  achieve  this  goal: 

a)  Mark  elements  that  require  refinement; 

b)  Add  protective  layers  of  elements  to  be  refined; 

c)  Avoid  elements  that  become  too  small,  or  that 
have  been  refined  too  often; 

d)  Obtain  preliminary  list  of  sides  where  new  points 
will  be  introduced; 

e)  Add  further  sides  to  this  list  un*..  an  admissible 
refinement  pattern  is  achieved. 

The  first  three  of  these  steps  are  obvious.  The  last 
two  are  explained  in  more  detail. 


8* '<.3 


fc  - 


ft-K* 


d)  Obtain  preliminary  list  of  sides  for  new  points 
Given  the  aidc/element  information  obtained  in  sub¬ 
step  4.4. 1.1,  one  can  determine  a  first  aet  of  sides  on 
which  new  gridpointa  need  to  be  introduced.  This  set 
of  sides  is  still  preliminary,  as  only  certain  types  of 
refinement  are  allowed. 

e)  Add  farther  sides  to  achive  admissible  refinement 
The  list  of  sides  marked  for  the  introduction  of  new 
points  is  still  preliminary  at  this  point.  In  most  cases, 
it  will  not  lead  to  an  admissible  refinement  pattern 
to  construct  a  new  mesh.  Therefore,  further  sides 


are  marked  for  the  introduction  of  new  points  until 
an  admissible  refinement  pattern  is  reached.  This  is 
accomplished  by  looping  several  times  over  the  ele¬ 
ments,  checking  on  an  element  level  whether  the  set 
of  sides  marked  can  lead  to  an  admissible  new  set  of 
sub-elements.  The  algorithm  used  is  based  on  the  ob¬ 
servation  that  the  admissible  cases  are  based  on  the 
introduction  of  new  points  along  one  side  (1:2),  three 
contiguous  sides  (1:4),  or  six  contiguous  sides  (1:8). 
These  admissible  cases  can  be  obtained  from  the  fol- 
wing  element-by-element  algorithm  (see  Figure  4.7): 


c 

c  - this  sub  assembles  the  rhs -contributions  at  points  (rhspo) 

c  from  the  element -rhs  (rhsel) 

c 

c  we  assume  that:  lconrv  contains  the  mesh  connectivity 

c  rhsel;  contains  the  element-rhsides 

c 


This  loop  will  not  vectorize,  as  rhspo  in  loop  3000  may  be  accessed 
several  times 

c 

c  - set  rhspo=0.0 

c 

call  rfilvc(npoin, rhspo, 0.0) 

c 

c  - loop  over  the  nodes 

c 

do  2000  inode=l,  nnode 
c 

c  - loop  over  the  elements 

c 

do  3000  ielem=l,nelem 
ipoin=lcoun(ielem,  inode) 

rhspo(  ipoin)  =«rhspo ( ipoin)  + rhsel ( ie Jem,  inode) 

3000  continue 
c 

2000  continue 

By  renumbering  the  elements,  accesing  the  same  point  within  each  group 
of  elements  can  be  avoided;  thus,  one  can  force  vector  is  at  ior.  of  loop  3000 
c 

c  - set  rhspo=G.O 

c 

call  rfilvc(npoin, rhspo, 0.0) 

c 

c  - outer  loop  over  the  groups  of  elements 

c 

ielel=0 

c 

do  1000  igrou*l,ngrou 


c 

c  - starting  and  ending  elements  of  this  group 

c 

iele0=ielel+l 
ieiel=lgrou(  igrou) 

c 

c  - loop  over  the  nodes 

c 

do  2000  inGde»l,  nnode 

c 

c  - loop  over  the  elements  of  this  group 

c 


cdir$  ivdep 
c 

do  3000  ielem»leleO,  ielel 
ipoin=»lconn(ielem,  inode) 

rhspo(ipoin)«rhspo(ipoin)+rhsel(ielem,  inode) 

3000  continue 
c 

2000  continue 
c 

c  - end  of  outer  loop  over  the  groups  of  elements 

c 

1000  continue 


Figure  4.7 


8-24 


-  Set  the  node-array  LI0DE(l:4)=O; 

-  Loop  over  the  sides  of  the  element:  if  the  side 
has  been  marked  for  the  introduction  of  a  new 
point,  set  LIODE(XPl)*l,  L*0DE(IP2)=*1,  where 
IP1,  IP2  are  the  end-nodes  corresponding  of  this 
side; 

-  Loop  over  the  sides  of  the  element:  if  U- 
0DB(XP1)«1  and  L*0DB(IP2)«1,  mark  the  side 
marked  for  the  introduction  of  a  new  point. 

Practical  calculations  with  several  admissible  layers 
of  refinement  and  large  grids  revealed  that  sometimes 
up  to  IS  passes  over  the  mesh  where  required  to  ob¬ 
tain  an  admissible  set  of  sides.  This  relatively  high 
number  of  passes  can  occur  when  the  mesh  exhibits 
regions  were  the  refinement  criterion  is  just  met  by  the 
elements.  Then,  the  list  of  sides  originally  marked  for 
refinement  will  be  far  from  an  admissible  one.  In  each 
pass  over  the  mesh,  a  further  ‘layer’  of  elements  with 
admissible  sides  marked  for  refinement  will  be  added. 
Moreover,  as  an  element  can  be  refined  in  six  possible 
ways,  in  some  cases  it  may  take  three  passes  to  go 
from  a  1:2  to  a  1:8  case.  Thus,  the  ‘front’  of  elements 
with  an  admissible  set  of  sides  marked  for  refinement 
may  advance  slowly,  resulting  in  many  passes  over  the 
mesh.  A  considerable  reduction  in  CPXJ  is  realized  by 
presorting  the  elements  as  follows: 

-  Add  up  all  the  sides  marked  for  refinement  in  an 
element; 

-  If  0,  1  or  6  sides  were  marked:  do  not  consider 
further; 

-  If  4  or  5  sides  were  marked:  mark  all  sides  of  this 
element  to  be  refined; 

•  If  2  or  3  sides  were  marked:  analyze  in  depth  as 
described  above. 

This  then  yields  the  final  set  of  sides  on  which  new 
gridpoints  are  introduced. 

4.4.1  3  Identification  of  Elements  to  be  Deleted 

The  aim  of  this  sub-step  is  to  determine  which  points 
are  to  be  deleted,  so  that  ‘he  resulting  coarsening 
patterns  on  an  element-level  belong  to  the  allowed 
cases  listed  above,  thus  producing  a  compatible,  valid 
new  mesh.  Four  main  steps  are  necessary  to  achieve 
this  goal: 

a)  Mark  elements  to  be  deleted; 

b)  Filter  out  elements  where  father  and  all  sons  are 
to  be  deleted; 

c)  Obtain  preliminary  list  of  points  to  be  deleted; 

d)  Delete  points  from  this  list  until  an  admissible 
coarsening  pattern  is  achieved. 

The  first  two  of  these  steps  are  obvious.  The  Isst  two 
are  explained  in  more  detail. 

c)  Obtain  preliminary  list  of  points  to  be  deleted 
Given  the  list  of  parent-elements  tv  be  coarsened,  one 
can  now  determine  a  preliminary  Pat  of  points  to  be 
deleted.  Thus,  all  the  points  that  would  be  deleted  if 
all  the  elements  contained  in  this  list  were  coarsened 
are  marked  as  ‘total  deletion  points’. 


d)  Delete  points  to  achieve  admissible  coarsening 
The  list  of  total  deletion  points  obtained  in  the  pre¬ 
vious  step  is  only  preliminary,  as  unallowed  coarsen¬ 
ing  esses  may  appear  on  an  element  level.  Therefore 
loops  are  performed  over  the  elements,  deleting  all 
those  total  deletion  points  which  would  result  in  u- 
nallowed  coarsening  cases  for  the  elements  adjoining 
them.  This  process  is  stopped  when  no  incompatible 
total  deletion  points  are  left.  As  before,  this  process 
may  be  made  considerably  foster  by  grouping  togeth¬ 
er  and  treating  differently  the  parent  elements  with 
0,1,2, 3, 4, 5  or  6  total  deletion  points. 

4.4.1.4  Refinement  of  the  Grid  Where  Needed 

The  introduction  of  further  points  and  elements  is 
performed  in  two  independent  steps,  which  in  princi¬ 
ple  could  be  performed  in  parallel. 

a)  Points:  To  add  further  points,  the  sides  marked 
for  refinement  in  sub-step  4.1.2  are  grouped  togeth¬ 
er.  For  each  of  these  sides  a  new  grid-point  will  be 
introduced.  The  interpolation  of  the  coordinates  and 
unknowns  is  then  performed  using  the  side/point  in¬ 
formation  obtained  in  sub-step  4.1.1.  These  new  co¬ 
ordinates  and  unknowns  are  added  to  their  respective 
arrays.  In  the  same  way  new  boundary  condition- 
s  are  introduced  where  required,  and  the  location  of 
new  boundary  points  is  adjusted  using  the  CAD-CAM 
data  defining  the  computational  domain. 

b)  Elements:  In  order  to  add  further  elements,  the 
sides  marked  for  refinement  are  labelled  with  their 
new  gridpoint-nuinber.  Thereafter,  the  element/side 
information  obtained  in  sub-step  4.1.1  above  is  em¬ 
ployed  to  add  the  new  elements.  The  elem<  ^  «.o  be 
refined  are  grouped  together  according  to  e  refine¬ 
ment  cases  shown  in  Figure  4.4.  Each  case  is  .  eated 
in  block  fashion  in  a  separate  subroutine.  Perhaps 
the  major  breakthrough  of  the  present  work  was  the 
reduction  of  the  many  possible  refinement  cases  to 
only  six.  In  order  to  accomplish  this,  some  informa¬ 
tion  for  the  2:8+  and  the  4:8+  cases  is  stored  ahead 
in  scratch  arrays.  After  these  elements  have  been  re¬ 
fined  according  to  the  2:8  and  4:8  cases,  their  sons 
are  screened  for  further  refinement  using  this  infor¬ 
mation.  Ail  sons  that  require  further  refinement  are 
then  grouped  together  as  1:2  or  1:4  cases,  and  pro¬ 
cessed  iu  turn. 

As  the  original  description  of  all  variables  was  per¬ 
formed  using  linear  elements,  the  linear  interpolation 
of  the  unknowns  to  the  new  points  will  be  conserva¬ 
tive.  However,  small  conservation  losses  will  occur 
at  curved  surfaces.  These  losses  are  considered  to  be 
both  unavoidable  and  small. 

4.4.1.5  Goarsening  of  the  Grid  Where  Needed 

The  deletion  of  points  and  elements  is  again  per¬ 
formed  in  two  independent  steps,  which,  in  principle, 
could  be  performed  in  parallel, 
a)  Points:  The  points  to  be  deleted,  having  been 
marked  in  sub-step  4.1.3  above,  all  that  remains  to 
be  done  is  to  fill  up  the  voids  in  the  coordinate-, 


unknown-  and  boundary  condition-arrays  by  renum¬ 
bering  points  and  boundary  conditions, 
b)  Elements:  The  deletion  of  elements  is  again  per¬ 
formed  blockwise,  by  grouping  together  all  elements 
corresponding  to  the  coarsening  cases  shown  in  Fig¬ 
ure  4.6.  Thereafter,  the  elements  are  also  renumbered 
(in  order  to  fill  up  the  gaps  left  by  the  deleted  ele¬ 
ments),  and  the  point-renumbering  is  taken  into  con¬ 
sideration  within  the  connectivity-arrays. 

It  is  clear  that  the  coarsening  procedure  is  non¬ 
conservative.  However,  no  physical  or  numerical 
problems  have  ever  been  observed  by  using  it.  This 
may  be  explained  by  the  fact  that  the  coarsening  is 
done  in  those  regions  where  the  solution  is  smooth. 
Thus,  the  coarsened  grid  represents  the  sdution  very 
well,  and  consequently  the  conservation  losses  are  »- 
mall.  Moreover,  those  regions  where  the  maintenance 
of  conservation  is  important  (e.g.  discontinuities)  are 
never  affected. 

4.5  Adaptive  Remeshfag 

Adaptive  remeshing  is  a  very  competitive  adaptation 
technique  for  steady-state  applications  or  problems 
with  moving  bodies.  For  the  latter  class  of  problem- 
s,  a  fixed  mesh  structure  will,  in  moat  cases,  lead  to 
badly  distorted  elements.  This  means  that  at  least 
a  partial  regeneration  of  the  computational  domain 
is  required.  On  the  other  hand,  as  the  bodies  move 
through  the  flowfield,  the  positions  of  relevant  flow 
features  will  change.  Therefore,  in  most  of  the  com¬ 
putational  domain,  a  new  mesh  distribution  will  be 
required.  The  idea  is  to  regenerate  the  whole  compu¬ 
tational  domain  adaptively,  taking  into  consideration 
the  current  flowfield  solution.  Any  of  the  automat¬ 
ic  grid  generation  techniques  outlined  above  may  be 
employed  to  accomplish  this.  In  my  codes  I  tend  to 
use  the  advancing  front  technique  [Ar.2-4],  The  steps 
required  for  one  adaptive  remeshing  are  as  follows: 

R.l  Obtain  the  error  indicator  matrix  for  the  grid- 
points  of  the  present  grid. 

R.2  Given  the  error  indicator  matrix,  get  the  element 
site,  element  stretching  and  stretching  direction 
for  the  new  grid. 

R.3  Using  the  old  grid  as  the  ‘background  grid’, 
remesh  the  computational  domain  using  the  ad¬ 
vancing  front  technique. 

R.4  If  further  levels  of  global  h-refinement  are  do- 
sired,  refine  the  new  grid  globally. 

R.5  Interpolate  the  solution  from  the  old  grid  to  the 
new  one. 

4.5.1  Local  Remeshiny 

Practical  simulations  indicate  that  the  appearance  of 
badly  distorted  elements  occurs  at  a  frequency  that 
is  much  higher  than  expected  from  the  element  size 
prescribed.  Given  the  relatively  high  cost  of  glob¬ 
al  remeshing,  local  remeshing  in  the  vicinity  of  the 
elements  that  became  too  distorted  becomes  an  at¬ 
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tractive  option.  The  steps  required  are  as  follows: 

L.l  Identify  the  badly  distorted  elements  in  the 
layers  that  move,  writing  them  into  a  list 
LEKEKfl :ISRSM). 

L.2  Add  to  this  list  the  elements  surrounding  these 
badly  distorted  elements. 

L.3  Form  ‘holes’  in  the  present  mesh  by: 

L.3.1  Forming  a  new  background  mesh  with  the 
elements  stored  in  the  list  LEREN  . 

L.3. 2  Deleting  the  elements  stored  in  LEREN  from 
the  current  mesh. 

L.3.3  Removing  all  unused  points  from  the  grid 
thus  obtained. 

L.4  Recompute  the  error  indicators  and  new  clement 
distribution  for  the  background  grid. 

L.5  Regrid  the  ‘holes’  using  the  advancing  front 
method. 

Typically,  only  a  very  small  number  of  elements  (< 
10)  becomes  so  distorted  that  a  remeshing  is  required . 
Thus,  local  remeshing  is  a  very  economical  tool  that 
allowes  reductions  in  CPU-requirements  by  more  than 
60%  for  typical  runs. 

5.  EFFECTIVE  USE  OF 
SUPERCOMPUTER  HARDWARE 

However  clever  an  algorithm  may  be,  it  has  to  run 
efficiently  on  today’s  supercomputer  hardware.  The 
following  section  examines  the  main  issues  involved 
for  each  type  of  supercomputer  currently  available,  as 
well  as  techniques  to  use  this  hardware  as  efficiently 
as  possible.  There  are  three  main  types  of  supercom¬ 
puter  currently  available.  These  are: 

a)  The  by  now  traditional  vector  machines,  which 
achieve  high  speeds  by  splitting  the  necessary  arith¬ 
metic  operations  between  subsequent  members  of  a 
vector- loop, 

b)  Single  Instruction  Multiple  Data  (SIMD)  ma¬ 
chines,  that  perform  the  same  arithmetic  operation 
across  a  large  number  of  low-level  processors,  and 

c)  Multiple  Instruction  Multiple  Data  (MIMD)  ma¬ 
chines,  that  perform  the  different  arithmetic  opera¬ 
tions  across  many  medium-level  processors. 

One  emerging  architecture  for  future  machines  is  the 
NIMD-machine,  where  only  a  few  (N)  different  pro¬ 
cesses  sre  carried  out  over  a  large  number  (M)  of 
powerful  vector  processors.  An  architecture  like  this 
would  require  the  programmer  to  take  into  consider¬ 
ation  all  the  individual  aspects  ecountered  in  each  of 
the  presently  available  supercomputer  architectures. 

5.1  V<?9Wr  Maduna 

At  the  beginning  of  the  1980s,  two  different  classes 
of  vector  machines  where  in  use:  memory  to  memory 
(CYBER-205)  and  register  to  register  (CRAY)  archi¬ 
tectures.  Due  to  its  greater  versatility,  the  second 
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type  of  machine  is  the  dominant  vector  machine  at 
present.  Register  to  register  machines  prefer  chunky 
loops,  high  flop  to  memory  access  ratios,  and  low  in¬ 
direct  addressing.  Unfortunately,  most  simple  flow 
solvers  look  just  the  opposite;  they  have  extremely 
simple  loops,  low  flop  to  memory  access  ratios,  and 
high  amount  of  indirect  addressing.  Just  to  illustrate 
the  importance  of  indirect  addressing,  the  reader  may 
be  reminded  that  even  with  hardware  gather/scatter, 
it  takes  the  equivalent  of  2.5  multiplications  to  get 
a  number  from  memory  using  indirect  addressing. 
Therefore,  an  important  issue  when  coding  for  perfor¬ 
mance  on  these  machines  is  the  reduction  of  indirect 
addressing  operations  required.  For  tetrahedral  ele¬ 
ments,  the  amount  of  indirect  addressing  operations 
required  can  be  approximately  halved  by  going  from 
ar.  element-based  data  structure  to  an  edge-based  da¬ 
ta  structure.  This  change  of  data  structure  avoids 
redundant  information,  leading  also  to  a  proportional 
reduction  in  CPU  and  memory  requirements  [Se.3- 
5].  A  second  important  issue  is  the  vectoriz ability 
of  scatter-add  loops.  Consider  the  following  element 
RHS  assembly  loop; 

Algorithms  to  group  the  elements  into  non¬ 
conflicting  groups  fall  into  the  category  of  colouring 
schemes  [Sc.  1,2].  This  has  been  exemplifled  for  the 
2-D  mesh  shown  in  Figure  5.1.  Given  that  on  typical 
vector-machines  one  only  needs  vector  lengths  that 
are  a  few  multiples  of  64,  it  is  easy  to  construct  very 
well-balanced  element  orderings  that  span  the  whole 
mesh,  except  for  one  last  group  with  less  than  64  el¬ 
ements. 


5.2  SIMP  Machines 

SIMD  machines,  as  exemplified  by  the  Thinking  Ma¬ 
chines  CM-series,  operate  very  efficiently  on  nearcst- 
neighbour  transfer  of  operations.  On  the  other  hand, 
general  data  exchange  operations,  as  required  for  un¬ 
structured  grids,  take  a  very  large  amount  of  time. 
A  simple  gather  takes  the  equivalent  of  20-60  flops. 
Several  routers  or  pre-compilers  have  been  devised  to 
alleviate  this  problem  [Sc6j.  At  the  same  time,  renum¬ 
bering  strategies  have  been  explored  [Sc7].  Both  ap¬ 
proaches  combined  lead  to  a  significant  decrease  in 
indirect  adressing  overhead.  On  the  other  hand,  they 
are  useless  for  more  general  applications  where  the 
mesh  topology  changes  every  few  time9teps  (remeah- 
ing,  h-refinement.  etc.).  Therefore,  only  a  few  steady- 
state  or  fixed  mesh  transient  applications  have  been 
extremely  9uccessfull  on  this  type  of  machine  (Sc.8- 
11].  SIMD  machines  may  be  compared  to  memory  to 
memory  vector  machines:  they  inherently  lack  gener¬ 
ality,  which  may  lead  to  their  eventual  demise. 

5  3  MIMD  Machines 

M1MD  machines,  as  exemplified  by  the  Intel,  NCube, 
Parsytech,  etc.  hypercubes,  consist  of  fairly  powerful 
processors  that  are  linked  together  by  message  pass¬ 
ing  and  synchronization  software.  In  the  future,  each 
of  these  processors  will  be  a  vector-processor.  This 
implies  that  most  of  the  algorithmic  considerations 
discussed  for  vector  machines  will  carry  over  to  these 
machines.  The  main  issues  when  trying  to  code  opti¬ 
mally  for  this  type  of  supercomputer  are: 
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Figure  6.1:  Renumbering  or  Colouring  of  Elements  for  Vectorized  Scatter-Add 


x-r; 


a)  Minimization  of  Idle  Time:  In  the  worst  case  s- 
cenario,  all  but  one  processor  wait  for  the  last 
processor  to  finish  a  certain  task.  This  implies 
that  one  has  to  strive  for  the  same  amount  of 
work  and  communication  in  each  processor. 

b)  Minimization  of  Interprocessor  Information  Flow: 
With  processor  performance  advancing  rapidly  in 
comparison  to  interprocessor  transfer  speeds,  the 
amount  of  information  required  between  proces¬ 
sors  has  to  be  minimized.  In  general,  this  will 
lead  to  a  minimization  problem  for  the  area-to- 
volume  ratio  of  the  processors. 

c)  Extra.  Lavers/ Additional  Work  .Tradeoffo  The 

amount  of  required  information  transfer  between 
processors  can  be  staged  (usual  code),  or  more  in- 
lv.  '■nation  can  be  gathered  ahead  of  the  timestep. 
In  the  latter  case,  no  further  information  transfer 
is  required  within  a  timestep.  On  the  other  hand, 
more  layers  of  information  surrounding  each  do¬ 
main  are  required.  Both  approaches  are  possible, 
and  the  performance  of  each  may  depend  more 
on  the  indiviual  hardware  of  each  machine  than 
anything  else. 

5.3.1  General  Considerations 

T  he  effective  use  of  any  parallel  machine  requires  the 
following  general  steps  for  the  solution  of  the  problem 
at  hand: 

P.  1  Break  up  the  problem  to  be  solved  into  pieces; 
P.2  Hand  each  processor  a  piece  of  the  problem; 

P.3  If  required:  provide  for  interprocessor  transfer  of 
information; 

P.4  Assemble  the  results. 

The  processor  hierarchy  and  scheduling  may  also  vary. 
Some  of  the  possible  choices  are: 

-  All  processors  working  at  the  same  level  (used  for 
grid  smoothing  and  explicit  flow  solvers); 

-  A  pyramid  of  masters  that  hand  out  and/or  per¬ 
form  tasks; 

-  A  one  master  -  many  slaves  doing  different  oper¬ 
ations  hierarchy  (used  for  unstructured  grid  gen¬ 
eration); 

-  A  one  master  -  many  slaves  doing  the  same  op¬ 
eration  hierarchy  (the  SIMD  paradigm). 

Porting  a  typical  flow-code  to  a  MIMD  requires  the 
following  pieces  of  software: 

-  Parallel  Input  Modules; 

-  Parallel  Domain  Subdivision  Modules  for  Load 
Balancing; 

-  Node- Versions  of  the  Flow-Code  for  Parallel  Ex¬ 
ecution; 

-  Interdomain  Info- Transfer  Modules; 

•  Parallel  Adaptive  Grid  Regeneration  Modules; 

•  Parallel  H-refinement  Modules; 

-  Parallel  Output  Modules. 

1  le  message  is  clear:  do  everything  in  parallel,  or 
don’t  even  start. 

5.4.  Domain  Splitting 

The  aim  of  any  domain-splitting  algorithm  devised 


for  parallel  machines  is  to  minimize  the  interprocessor 
transfer  of  information.  For  a  given  information  flux 
fa,  and  subdomains  of  equal  size,  this  is  equivalent 
to  the  minimization  of  the  surface  to  volume  ratio  of 
each  subdomain  »: 

-q—  —  mm  .  (».l) 

Several  algorithms  for  splitting  a  domain  can  be  en¬ 
visioned.  These  are,  in  ascending  order  of  generality 
(see  Figure  5.2): 

•  Simple  Cartesian  splitting; 

-  Quadtree/Octree  splitting; 

-  Unstructured  grid  splitting. 

It  seems  natural  to  choose  the  unstructured  grid  s- 
plitting  for  the  following  reasons: 

-  A  fine  unstructured  grid  can  be  assumed.  This, 
in  turn,  allows  division  of  the  grid  into  subdo¬ 
mains  of  nearly  equal  size. 

-  Three  major  pieces  of  software,  the  grid  s- 
moother,  the  field  solver,  and  the  grid  generator 
can  all  use  the  same  algorithm  to  generate  sub- 
domains.  This  reduces  the  software  development 
costs. 

5.4.1  A  Domain-Splitting  Algorithm 

A  simple  algorithm  that  attempts  to  obtain  a  sub¬ 
division  which  satisfies  Eqn.(5.1)  is  the  following 
‘wavefront’-type  scheme: 

Assume  given:  as  part  of  the  background  grid 

-  The  elements  that  surround  each  point; 

-  An  initial  starting  element  within  each  subdo¬ 
main. 

-  A  real  number  REFFB  in  each  element  associated 
with  the  effort  to  be  spent  in  it  (e.g.  the  number 
of  elements  to  be  created); 

-  A  real  number  KEFFD  that  denotes  the  desired 
average  cumulative  effort  in  each  subdomain. 

Then: 

$.1  Initialize  point  and  element  arrays; 

5.2  Initialize  domain  counter  REF  FT)  ; 

5.3  Start  next  subdomain: 

Update  domain  number; 

Initialize  subdomain  effort  counter; 

5.4  Select  the  next  point  to  be  surrounded  from  the 
order  of  creation  list; 

5.5  Mark  this  point  as  totally  surrounded; 

5.6  For  each  of  the  elements  surrounding  this  point: 

If  the  element  has  not  been  marked  as  be¬ 
longing  to  a  domain  before: 

Mark  the  element  as  belonging  to 
the  present  domain; 

Update  subdomain  effort  counter; 
For  the  nodes  of  this  element: 

If  the  point  is  not  yet  totally 
surrounded  and  has  not 
yet  been  incorporated  into  the 
order  of  creation  list: 
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a)  Cartesian 


b)  Quad-Tree 


c)  Background  Grid 


Add  the  point  to  the  order  of 
creation  list; 

Endif 

Endif 

S.7  If  the  subdomain  effort  counter  exceeds  REFFD 


Figure  5.3  shows  how  this  algorithm  works  on  a  simple 
2-D  domain,  where  it  was  assumed  that  REFFE»1 .0 
and  REFFDaS.O.  For  more  expensive,  but  almost  as 
effective  algorithms,  see  [Mi.  1]. 


Compress  point  creation  list,  eliminating  all 
totally  surrounded  points. 

Goto  S.3 
Otherwise 

Goto  S.4 
Endif 


5.5  Parallel  Grid  Generation 

For  the  grid  generator,  the  steps  taken  are  as  follows: 
G.l  Subdivide  the  global  domain  to  be  gridded  into 
subdomains  using  the  background  grid; 

G.2  Grid  up  each  subdomain  separately; 

G.3  Grid  up  the  inter-subdomain  regions; 

G.4  Assemble  the  result. 


k-2>; 


Figure  5.3;  Simple  Domain  Splitting  Algorithm 
i,j:  i:  order  of  creation,  j:  domain- number 


Given  the  subdomains,  there  are  two  possible  parallel 
grid  generation  strategies  (see  Figure  5.4): 

a)  In-Out: 

-  Grid  each  subdomain  separately; 

-  Grid  pairs  of  adjacent  domains; 

-  Grid  the  corners. 

One  can  also  work  directly  with  corners  after  gridding 
each  subdomain.  However,  it  is  clear  that  the  achiev¬ 
able  parallelism  is  greater  if  they  are  postponed. 

*>) 

-  Grid  the  corners; 

-  Grid  pairs  of  adjacent  domains; 

•  Grid  each  subdomain  separately. 

This  second  approach  avoids  the  need  of  checking  a- 
gainst  the  borders  of  the  subdomain  in  the  last,  and 
most  CPU-intensive  step.  However,  the  correct  grid- 
ding  of  corners  and  lines  without  conflicts  is  more 
difficult  than  in  the  first  method.  In  the  present  case, 
the  first  approach  was  implemented. 

5.5.1  Generation  of  Subdomain-Grids 

In  each  subdomain,  the  advaneing-front  grid  genera¬ 
tion  technique  (Ga.3-7]  is  used  to  generate  an  unstruc¬ 
tured  grid.  Given  a  background  gTid  that  defines  the 
desired  spatial  distribution  of  element  size  and  shape, 
the  following  steps  are  required  (see  Figure  5.5): 

F,1  Find  the  outer  faces  of  the  background  grid. 

F.2  If  no  active,  initial  faces  are  present:  generate  a 


first  element  based  on  the  background  grid.  This 
initial  element  yields  a  first  set  of  active  faces. 

F.3  Select  the  next  available  active  face  to  be  deleted 
from  the  front.  In  order  to  avoid  large  elements 
crossing  over  regions  of  small  elements,  the  face 
forming  the  smallest  new  element  is  selected  as 
the  next  face  to  be  deleted  from  the  list  of  faces. 

F.4  For  the  face  to  be  deleted: 

F.4.1  Select  a  ‘best  point’  position  for  the  intro¬ 
duction  of  a  new  point  IPIEtf. 

F.4. 2  Determine  whether  a  point  exists  in  the  al¬ 
ready  generated  grid  that  should  be  used  in 
lieu  of  the  new  point.  If  there  is  such  a  point, 
set  this  point  to  IPKBV  and  continue  search¬ 
ing  (go  to  F.4. 2). 

F.4.3  Determine  whether  the  element  formed  with 
the  selected  point 

IPIEtf  crosses  any  given  faces.  If  it  does, 
select  a  new  point  as  IPIBW  and  try  again 
(go  to  F.4.3). 

F.4. 4  Determine  whether  the  element  formed  with 
the  selected  point 

IPIEtf  crosses  any  given  background  faces.  If 
it  does,  mark  the  current  face  as  unavailable 
and  select  a  new  face  (go  to  F.8). 

F.5  Add  the  new  element,  point,  and  faces  to  their 
respective  lists. 

F.6  Find  the  generation  parameters  for  the  new  faces 
from  the  background  grid. 

F.7  Delete  the  known  faces  from  the  list  of  faces. 
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Figure  5.4:  Parallel  Grid  Generation  Strategies 
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Fiture  5.5:  Advancing  FVont  Grid  Generation  in 
Each  Subdomain 

F.8  If  tbete  are  any  active,  available  faces  left  in  the 
front,  go  to  F.3. 

Compared  to  the  usual  advancing  front  method,  the 
only  modifications  required  are  steps  F.l,  P.2  and 
F.4.4. 

5.5.2  Data  Management 

5. 5. 2.1  Data  Structures  for  the  Host 

In  order  to  simplify  the  complexity  of  data  handling 
as  much  as  possible,  the  following  rules  were  followed. 

-  A  point  belongs  uniquely  to  one  (and  only  one) 
subdomain; 

-  An  clement  belongs  to  the  lowest  domain-number 
of  its  nodes; 

-  A  face  may  belong  to  more  than  one  domain. 

In  the  global  arrays  that  store  the  mesh  as  it  is  assem¬ 
bled,  the  subdomains  have  allotted  slots.  The  storage 
scheme  used  is  shown  diagrammatically  for  the  coor¬ 
dinates  in  Figure  5.6. 

Oonum-njmOtr  _ _ 

I -  -  m  npoil  I  -  r  npoii  I  ,  .... 


Figure  5.6:  Storage  of  Coordinate  Information  in 
the  Master-Node 

5. 5. 2. 2  Data  Structures  for  the  Node 

No  special  data  structures  are  required  for  the  node 
In  fact,  the  node-code  is  the  same  as  the  one  used  for 
a  single-processor  machine. 

5.5.3  Information  Flow 

The  information  transfer  required  for  the  parallel  grid 
generation  algorithm  described  above  is  as  follows: 

Ftom-the.Hqall<Uti?  Jiadsi 
•  Assemble  the  required  information  for  the  do¬ 
mains  involved: 

•  Background  Grids 


-  Active  Faces 

-  Points  of  these  Faces 

-  Renumber  to  obtain  local  arrays,  remembering 
the  renumbering  order; 

-  Send  the  assembled  information  to  the  node. 
From  the  Node  to  the  Host: 

-  Obtain  the  remaining  information  from  the  node: 

-  Active  Faces 

-  New  Clements 
*  New  Points 

-  Renumber  and  store  in  the  global  arrays. 

When  generating  the  inter-domain  regions,  one  can 
not  generate  at  the  same  time  the  interfaces  of  al- 
1  neighbors  for  a  certain  subdomain.  Therefore,  a 
contingency  list  was  implemented  that  avoids  these 
conflicts.  It  operates  on  the  premise  that  during  each 
tank  hand-out  pass  over  the  interdomain-boundaries 
or  corners,  a  subdomain  may  be  touched  once  only. 
During  each  task  hand-out  pass,  the  following  opera¬ 
tions  are  performed: 

C.l  Initialize  a  subdomain-array  ; 

C.2  Loop  over  the  interdomain-boundaries  (or  cor¬ 
ners): 

if  none  of  the  subdomains  involved  has  al¬ 
ready  been  used: 

-  Mark  the  subdomains  involved; 

-  Renumber  and  send  to  the  next 
available  processor  the 

information  necessary; 

Endif 

5.6  Parallel  Smoothing  and  Explicit  Flow  Solvers 

Practical  implementations  of  cither  advancing  front 
or  Voronoi  grid  generators  indicate  that  in  certain  re¬ 
gions  of  the  mesh  abrupt  variations  in  element  shape 
or  size  may  be  present.  These  variations  appear  even 
when  trying  to  generate  perfectly  uniform  grids.  The 
usual  way  to  circumvent  this  problem  is  to  improve 
the  uniformity  of  the  mesh  by  smoothing.  The  most 
commonly  used  smoother  is  the  so-called  Laplacian 
smoother.  The  sides  of  the  triangulation  are  assumed 
to  represent  springs.  These  springs  are  then  relaxed 
in  time  using  explicit  time  stepping,  until  an  equilibri¬ 
um  of  spring-forces  has  been  established.  The  flow  of 
information  and  most  of  the  loops  in  such  a  smoother 
are  equivalent  to  an  explicit  time-marching  scheme. 
Therefore,  the  techniques  used  for  the  smoother  and 
the  flow  solver  are  identical.  The  implementation  of 
a  spring  analogy  smoother  or  an  explicit  flow  solver 
on  a  parallel  machine  requires  the  following  steps: 

5.1  Subdivide  the  given  mesh  into  subdomains; 

5.2  For  each  smoothing  pass  or  timestep: 

-  Smooth  each  subdomain  separately; 

-  Exchange  boundary  information; 

5.3  Assemble  the  result. 

Thus,  for  the  smoother  and  the  flow  solver  an  equal- 


level  hierarchy  is  employed.  This  seems  appropriate, 
as 

a)  no  contingency  tests  (other  than  the  synchroniza¬ 
tion  at  the  end  of  one  pass  over  the  elements)  are 
required,  and 

b)  the  amount  of  CPU  spent  is  roughly  the  same  in 
each  processor. 

5.6.1  Smoothing  of  Subdomain-Grids 

In  each  subdomain,  the  standard  Laplaciau  smoother 
is  employed.  Each  side  of  the  triangulation  is  sup¬ 
posed  to  represent  a  string.  Thus,  the  force  acting  on 
each  point  is  given  by: 

n*, 

*=<£>, -x.)  •  (5-2) 

;=» 

where  e  denotes  the  spring  constant,  xj  the  coordi¬ 
nates  of  the  point,  and  the  sum  extends  over  all  the 
points  surrounding  the  point.  The  time-advancement 
for  the  coordinates  is  accomplished  as  follows: 

Ax,  =  A<~fj  .  (5  3) 

n*i 

At  the  outer  boundary  of  the  subdomain,  Ax  =  0. 
Usually,  3-d  timesteps  or  passes  over  the  mesh  yield 
an  acceptable  mesh. 

5.6.2  Data  Structures 

In  order  to  be  able  to  also  move  the  boundary  points 
or  change  the  unknowns  at  subdomain  interlaces,  ar, 
exchange  of  information  between  processors  has  to 
be  allowed.  Two  possible  choices  are  possible  (see 
Figure  5  7): 


Assemble  rhs  m  eicb  domain  -  mjo  Oyer  of  elements 

Exchange-ado  rhs  at  interface  -  oooaie  each  domain  as  m  se-ial 

Update  unknowns  ■  Exchange  updated  unknowns 

Figure  5.7:  Updating  Strategies 

a)  Right-Hand  Side  Information: 

-  Assemble  ail  force  contributions  (right-band 
sides)  in  each  processor; 


-  Exchange  force  contributions  between  proces¬ 
sors,  adding; 

-  Update  the  coordinate  vectors  of  the  interfaces. 
The  advantage  of  this  approach  is  that  the  total 
amount  of  information  (elements,  points)  required  for 
the  parallel  version  is  the  same  as  for  the  serial  ver¬ 
sion.  However,  the  original  serial  code  has  to  be  mod¬ 
ified  extensively. 

b)  Coordinate  Information: 

-  Add  a  further  layer  of  elements  to  each  subdo¬ 
main; 

•  Update  each  subdomain  as  for  the  serial,  1- 
domain  case; 

-  Exchange  the  updated  coordinate  information 
between  processors. 

In  this  approach,  the  total  amount  of  information  (ele¬ 
ments,  points)  required  for  the  parallel  version  is  larg¬ 
er  than  that  required  for  the  serial  version.  However, 
the  extra  amount  of  information  is  not  large  ( 1  layer 
of  elements).  On  the  other  hand,  the  code  employed 
in  each  subdomain  is  exactly  the  same  as  for  the  seri¬ 
al  case.  It  was  felt  that  this  advantage  outweighs  all 
possible  disadvantages. 

This  second  method  requires  the  addition  of  layers  of 
elements  at  the  boundaries  of  the  subdomains.  The 
following  algorithm  accomplishes  this  task: 

Assume  given: 

-  The  elements  that  surround  each  point; 

-  The  doinaln-nr.  each  element,  belongs  to. 

-Then; 

L.l  Initialize  pointer-lists  for  elements,  points  and 
receive-list*; 

L.2  For  each  point  IPOII; 

Get  the  smallest  domain-nr.  IDMII  of  the  el¬ 
ements  that  surround  it;  store  this  number  in 
LPMII(XPOII); 

For  each  element  that  surrounds  this  point: 
If  the  domain-nr.  of  this  element  is  larger 
than  IDMII  ; 

-  Add  this  element  to  domain  IDMII 
L.3  For  the  points  of  each  subdomain  IDOMI: 

If  LPNXK IPOII)  .  IS.IDOHI  : 

add  this  information  to  the  receive-list  for 
this  subdomain; 

Endif 

L.4  Order  the  receive-list  of  each  subdomain  accord¬ 
ing  to  subdomains; 

L.5  Given  the  receivc-lists,  build  the  send-list  for 
each  subdomain. 

5.6.3  Information  Flow 

The  information  transfer  required  for  the  parallel  s- 
moothing  or  flow  advancement  algorithm  described 
above  is  as  follows: 

From  the  Host  to  the  None: 


-  Assemble  the  -.required  information  for  the  do- 


mains  involved: 

-  Grid 

-  Send/ Receive  Lists 

-  Send  the  assembled  information  to  the  node. 
From  Node  to  Node: 

-  Send  the  updated  coordinates  of  all  nodes  stored 


in  the  scnd-list; 

Receive  the  updated  coordinates  of  all  nodes  3- 
tored  in  the  teeeive-list; 

Overwrite  the  coordinates  for  these  received 
points. 


Rein  mbcr  and  store  in  the  global  arrays. 
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Multi  Element  Airfoil  Final  Assembled  Mesh 
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Multi  Element  Airfoil:  Smoothed  Mesh 
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Examples  i  Shock  Interaction  with  an  Elevated  Box 

-  Grid  Generator:  FRGEN2D 

-  Hydro-Code:  FEFL027  (2-D,  Euler,  H- Refinement) 

-  Plotting:  FEPL0T2D 

-  Details:  [Ae.6,  Ae.7] 


VIESH:  NELEM=338T«,  NPOIN=13l43 
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MESHt  NELEM-3T80S,  NPOINsl9ll4 


Fig  2.  Expanded  View*  of  (a)  Mesh  and  (b)  Pressure  Con¬ 
tours  Around  the  Box,  tw0.4. 
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Fig  3.  Expanded  Views  of  (a)  Mesh,  (b)  Pressure  Contours 
Around  the  Bax,  and  (c)  Pressure  Contour*  Under  the  Box 

>  — o  *  - - 


Fig  1.  Expanded  views  of  (a)  Mesh  and  (b)  Pressure  Con¬ 
tours  Around  the  Box;  (c)  Mesh  and  (d)  Pressure  Contours 
tor  the  Complete  Computational  Domain.  t«0.0. 
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Shock-Box  Interaction.  Free 


,  ALE-Euler,  Remeshing 


'light  After  Onset  of  Lift 
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t  with  a  Box 
FRGEN2D 


SFL044  (2-D,  AI.E-Euler,  Remeshing) 
OT2D 
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Examples  5 


von  Karman  Vortex  Street 

-  Grid  Generator:  FRGEN2D 

-  Hydro-Code:  FEBIC2D  (2-0,  Semi-Implicit) 

-  Plotting:  FEPLOT2D 

-  Parameters:  Ma=0.1,  Re— 100 

-  Details:  [Hawaii  1980  AIAA  CFD  Conf.) 


MESH  NELEM=  12278  .  NP0IN=  623S 


PRESSURE  MIN=  0.7 IE -*-02  .  MflX=  0.72E*02  .  0UC=  0.5SE- 


ENTR0PY  MIN=  0.42E*0:  .  "A<  =  0.12E+03  ..  0UC=  C. 
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VELOCITY  VECTORS 


von  Kannan  Vortex  Street 

-  Grid  Generator:  FRG15N2D 

-  Hydro-Code:  FEFL0IC2G  (2-D,  Incompressible,  H-ref.) 

-  Plotting:  FEPLOT2D 

-  Parameters:  Ro=1000 

-  Details:  [S1.2] 
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von  K&rmaa  Vortex  Street  at  Re=l,000  (FEFLOIC25) 
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Examples  3  Shock  Interaction  with  an  Idealised  Leopard-2  Tank 

-  Grid  Generator:  ERGEN 3D 

-  Hydro-Code:  PKFL074  (3-D.  ALE-Eulcr.  U-Rof ) 

-  Plotting:  FEPL0T3D,  PEP0ST3D 

-  Details:  (Ae.7,  Ac.9] 


Shock-Tank  Interaction  (FEFL074) 
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Shock  Interaction  with  a  T-62  Tank 

-  Grid  Generator:  FRGEN3D 

-  Hydro-Code:  FEFL074  (3-D,  ALE-Euler,  II-Raf.) 

-  Plotting:  FEPL0T3D,  FEP0ST3D 

-  Details:  [Ae.7,  Ae.10] 


FEFL052 


■V*  •- •%*■*  tu-  •^airar- 


8-55 


REFERENCES 

OVERVIEWS 

Books 

B.l  O.C.  Zienkiewio  and  K.  Morgan  -  Finite  Ele¬ 
ments  and  Approximation;  J.  Wiley  Si  Sons.  An 
excellent  introductory  book,  especially  for  those 
with  finite  difference  background. 

B.2  O.C.  Zienkiewicz  -  The  Finite  Element  Method; 
McGraw  Hill.  The  classic  book  on  Finite  Ele¬ 
ments.  Perhaps  a  bit  heavy  on  examples  from 
the  Swansea  team,  but  this  only  a  minor  flaw. 

B.3  O.  Pironneau  -  Finite  Element  Methods  for  Flu¬ 
ids;  J.  Wiley.  A  good  overview  of  the  field.  Re¬ 
quires  some  background  in  Numerical  Analysis. 

EULER  AND  NAV1ER-ST0KES  SOLVERS 

Euler  Solvers 

Eu.l  A.  Jameson,  W.  Schmidt  and  E.  Turkel-  Numeri¬ 
cal  Solution  of  the  Euler  Equations  by  Finite  Vol¬ 
ume  Methods  using  Runge-Kutta  Time-Stepping 
Schemes;  AIAA-Paper  81-1259  (1981). 

Eu.2  A.  Jameson,  T.J.  Baker  and  N.P.  Weatherhill 
-  Calculation  of  Inviseid  Transonic  Flow  over  a 
Complete  Aircraft;  AIAA-86-0103  (1986). 

Eu.3  P.D.  Lax  and  B.  Wcndroff  -  Systems  of  Conserva¬ 
tion  Laws;  Comm.Pure  Appl.Math.  13,  217-237 
(1960). 

Eu.4  R.W.  MacCormack  -  The  Effect  of  Viscosity  in 
Hypervelocity  Impact  Cratering;  AIAA  69-354 
(1969). 

Eu.5  J .  Donea  -  A  Taylor  Galerkin  Method  for  Con¬ 
vective  Transport 

Problems;  Int.J.Num.Metb.Engng.  20,  101-119 
(1984). 

Eu.6  R.  Lohner,  K.  Morgan  and  O.C.  Zienkiewicz  - 
The  Solution  of  Nonlinear  Systems  of  Hyperbol¬ 
ic  Equations  by  the  Finite  Element  Method;  In- 
l.J.Num.Meth. Fluids  4,  1043-1063  (1984). 

Eu.7  R.  Lohner,  K.  Morgan,  J.  Peraire  and  O.C. 
Zienkiewics  -  Finite  Element  Methods  for  High 
Speed  Flows;  AIAA-85-1531-CP  (1965). 


Fc.l  J.P.  Boris  and  D.L.  Book  -  Flux-corrected  Trans¬ 
port.  I.  SHASTA,  a  Transport  Algorithm  that 
works;  J.Comp.Phys.  11,  38-  (1973). 

Fc.2  D.L.  Book,  J.P.  Boris  and  K.  Hain  -  Flux- 
corrected  Transport.  II.  Generalizations  of  the 
^^^Method^Xjorn^Phys^8^4^G975^^^^^ 


Fc.3  J.P.  Boris  and  D.L.  Book  -  Flux-corrected  Trans¬ 
port.  III.  Minimal-Error  FCT  Algorithms; 
J.Comp.Phys.  20,  397-431  (1976). 

Fc.4  S.T.  Zalesak  -  Fully  Multidimensional  Flux- 
Corrected  Transport  Algorithm  for  Fluids; 
J.Comp.Phys.  31,  335-362  (1979). 

Fc.5  A.K.  Parrott  and  M.A.  Christie  -  FCT  Applied  to 
the  2-D  Finite  Element  Solution  of  Tracer  Trans¬ 
port  by  Single  Phase  Flow  in  a  Porous  Medi¬ 
um;  Proc.  ICFD-Conf.  on  Numerical  Method- 
s  in  Fluid  Dynamics,  Reading,  Academic  Press 
(1986). 

Fc.6  R.  Lohner,  K.  Morgan,  J.  Peraire  and  M.  Vah- 
dati  -  Finite  Element  Flux-Corrected  Transport 
(FEM-FCT)  for  the  Euler  and  Navier-  Stokes  E- 
quations;  ICASE  Rep.  87-4,  Int.J.Num.Meth. 
Fluids  7,  1093-1109(1987). 

Fc.7  R.  Lohner,  K.  Morgan,  M.  Vahdati,  J.P.  Boris 
and  D.L.  Book  -  FEM-FCT:  Combining  Unstruc¬ 
tured  Grids  with  High  Resolution;  Comm.  Appl. 
Num.  Meth.  4,  717-730  (1988). 

TVD  Schemes 

Tv.l  P.  Woodward  and  P.  Colella  -  The  Numeri¬ 
cal  Simulation  of  T  wo-Dimensional  Fluid  Flow 
with  Strong  Shocks;  J.Comp.Phys.  54,  115-173 
(1984). 

Tv.2  P.  Colella  -  Multidimensional  Upwind  Methods 
for  Hyperbolic  Conservation  Laws;  LBL- 17023, 
Preprint  (1983). 

Tv. 3  P.K.  Sweby  High  Resolution  Schemes  Using 
Flux  Limi'..ers  for  Hyperbolic  Conservation  Laws; 
SIAM  J.NumAnal.  21,  995-1011  (1984). 

Tv.4  P.L.  Roe  -  Approximate  Riemann  Solvers, 
Parameter  Vectors  and  Difference  Schemes; 
J.Comp.Phys.  43,  357-372  (1981). 

Tv.5  B.  van  Leer  -  Towards  the  Ultimate  Conser¬ 
vative  Scheme.  II.  Monotonicity  and  Conser¬ 
vation  Combined  in  a  Second  Order  Scheme; 
J.Comp.Phys.  14,  361-370  (1974). 

Tv, 8  A.  Harten  -  High  Resolution  Schemes  for  Hyper¬ 
bolic  Conservation  Laws;  J.Comp.Phys.  49,  357- 
393  (1983). 

Tv.7  S.  Osher  and  F.  Solomon  -  Upwind  Difference 
Schemes  for  Hyperbolic  Systems  of  Co  servation 
Laws;  Math. Comp.  38,  339-374  (1982). 

Tv.8  D.L  Whitaker,  B.  GroeBman  and  R.  Lohner  - 
Two-Dimensional  Euler  Computations  on  a  Tri¬ 
angular  Mesh  Using  an  Upwind,  Finite-Volume 
Scheme;  AIAA-89-0365  (1989). 

Unstructured  Multixrid  Methods 

Mg.l  R.  Lohner  and  K.  Morgan  -  An  Unstructured 

^^MultigridMetho^fo^Ellipti^Problein8^fa- 


1 

( 

» 

s 


I 

* 

! 

i 

i 

I 

1 


* 


v 

i 


t 

c 

V 

Si 

ft 


!' 

i 

i 

i 

i 

j— 


#-56 


t.J.Num.Meth.EDg.  24,  101-115(1987). 

Mg.2  R.  Lohner  and  K.  Morgan  -  Unstructured  Multi¬ 
grid  Methods;  Proc.  of  the  Second  European 
Multigrid  Conf.  (U.  Trottenberg  and  W.  Hack- 
busch  eds.),  GMD-Studien  Nr  110,  GMD,  Sankt 
Augustin  (1986). 

Mg.3  W.  Gentzsch  and  A.  Schluter  -  Uber  ein  Ein- 
schrittverfahren  mit  zyklischer  Schritt- 
weitenanderung  zur  Losung  parabolischer  Differ- 
entialgleichungen;  7, AMM  58,  T415-T416  (1978). 

Mg.4  E.  Perez,  J.  Periaux,  J.P.  Rosenblum,  B.  Stouf- 
flet,  A.  Dervieux  and  M.H.  Lallemand  -  Adap¬ 
tive  Full-Multigrid  Finite  Element  Methods  for 
Solving  the  Two-Dimensional  Euler  Equations; 
Proc.  10th  Int.Conf.Num.Meth.  Fluid  Dynam¬ 
ics,  Peking,  June  1986.  Springer  Verlag. 

Mg. 5  D.J.  Mavriplis  and  A.  Jameson  -  Multigrid  Solu¬ 
tion  of  the  Two-Dimensional  Euler  Equations  on 
Unstructured  Triangular  Meshes;  AIAA  J.  28,  8, 
1415-1425  (1990). 

Snakes  surd  Linelets 

51.1  O.  Hassan,  K.  Morgan,  and  J.  Peraire  -  An  Im¬ 
plicit  Finite  Element  Method  for  High  Speed 
Flows.  AIAA-90-0402  (1990). 

51.2  R.  Lohner  and  D.  Martin  -  An  Implicit,  Linelet- 
Based  Solver  for  Incompressible  Flows;  pp.  9- 
20  in  Advances  in  Finite  Element  Analysis  in 
Fluid  Dynamics  (M.N.  Dhaubhadel,  M.S.  Engel- 
man  and  J.N.  Reddy  eds.),  FED-Vol.  123,  ASME 
(1991). 

MESH  ADAPTATION 

Mesh  Adapts*’*™  bv  Movement 

Am.l  P.  A.  Gnoffo  -  A  Finite- Volume,  Adaptive  Grid 
Algorithm  Applied  to  Planetary  Entry  Flowfield- 
s;  AIAA  J.  21,  1249-1254  (1983). 

Am.2  A.R.  Diaz,  N.  Kikuchi  and  J.E.  Taylor  -  A 
Method  for  Grid  Optimization  for  the  Finite  El¬ 
ement  Method;  Comp.Meth.Appl.Mech.Eng.  41, 
29-45  (1983). 

Am.3  K.  Nakahashi  and  G.  S.  Dtiwert  -  A  Three- 
Dimensional  Adaptive  Grid  Method;  AIAA-85- 
0486  (1985). 

Am. 4  B.  Palmerio  and  A.  Dervieux  -  Application  of  a 
FEM  Moving  Node  Adaptive  Method  to  Accu¬ 
rate  Shock  Capturing;  Proc.  First  Int.  Conf.  on 
Numerical  Grid  Generation  in  CFD,  Landshut, 
W.  Germany,  July  14-17  (1986). 

Mesh  Adaptation  bv  Enrichment 

Ae.l  W.  Schonauer,  K.  Raith  and  K.  Glotz  -  The  Prin¬ 


ciple  of  Difference  Quotients  as  a  Key  to  the  Self- 
Adaptive  Solution  of  Nonlinear  Partial  Differen¬ 
tial  Equations;  Comp.Meth.Appl.Mech.Eng.  28, 
327-359  (1981) 

Ae.2  M.J.  Berger  and  J.  Oliger  -  Adaptive  Mesh  Re¬ 
finement  for  Hyperbolic  Partial  Differential  E- 
quations;  J.Comp.Phya.  53,484-512  (1984). 

Ae.3  J.F.  Dannenhoffer  and  J.R.  Baron  -  Robust  Grid 
adaptation  for  Complex  Transonic  Flows;  AIAA- 
86-0495  (1986). 

Ae.4  B.  Palmerio,  V.  Bitley,  A.  Dervieux  and  J.  Peri¬ 
aux  -  Self-Adaptive  Mesh  Refinements  and  Finite 
Element  Methods  for  Solving  the  Euler  Equa¬ 
tions;  Proc.  ICFD  Conf.  on  Numerical  Methods 
for  Fluid  Dynamics,  Reading,  U.K.,  March  1985. 

Ae.5  J.T.  Oden,  P.  Devloo  and  T.  Strouboulis  -  Adap¬ 
tive  Finite  Element  Methods  for  the  Analysis  of 
Inviscid  Compressible  Flow:  I.  Fast  Refinemcn- 
t/Unrefinement  and  Moving  Mesh  Methods  for 
Unstructured  Meshes;  Comp.  Meth.  Appl.  Mech. 
Eng.  59,327-362(1986). 

Ae.6  R.  Lohner  -  An  Adaptive  Finite  Element 
Scheme  for  Transient  Problems  in  CFD;  Com¬ 
p.Meth.Appl.Mech.Eng.  61,  323-338  (1987). 

Ae.7  R.  Lohner  -  Adaptive  H-Refinement  on  3-D  Un¬ 
structured  Grids  for  Transient  Problems;  AIAA- 
89-0365  (1989). 

Ae.8  J.D.  Baum  and  R.  Lohner  -  Numerical  Simu¬ 
lation  of  Shock- Elevated  Box  Interaction  Using 
an  Adaptive  Finite  Element  Shock  Capturing 
Scheme;  AIAA-89-0653  (1989). 

Ae.9  R.  Lohner  and  J.D.  Baum  -  Numerical  Simula¬ 
tion  of  Shock  Interaction  with  Complex  Geom¬ 
etry  Three-Dimensional  Structures  using  a  New 
Adaptive  H-Refinement  Scheme  on  Unstructured 
Grids;  AIAA-90-0700  (1990). 

Ae.10  J.D.  Baum  and  R.  Lohner  -  Numerical  Simula¬ 
tion  of  Shock  Interaction  with  a  Modern  Main 
Battlefield  Tank;  AIAA-91-1666  (1991). 

Ae.ll  D.J.  Mavriplis  -  Unstructured  and  Adaptive 
Mesh  Generation  for  High  Reynolds-Number 
Viscous  Flows;  pp.  79-91  in  Proc.  3rd  Int.  Conf. 
Num.  Grid  Gen.  in  CFD  (Sengupta  et  al.  eds), 
Pineridge  Press  (1988). 

Ae.12  D.J.  Mavriplis  -  Euler  and  Navier- Stokes  Compu¬ 
tations  for  Two-Dimensional  Geometries  Using 
Unstructured  Meshes;  ICASE  Rep.  90-3  (1990). 

Ae.13  O.C.  Zienkiewicz,  J.P.  de  S.R.  Gago  and  D.W. 
Kelly  -  The  Hierarchical  Concept  in  Finite  Ele¬ 
ment  Analysis;  Comp.Struct.  16,  53-65  (1983). 

Ae.14  I.  Babuska,  J.  Chandra  and  J.E.  Flaherty  (eds.)  • 
Adaptive  Computational  Methods  for  Partial  D- 
ifferential  Equations;  SIAM  Philadelphia  (1983). 


M_esh  Adaptation  bv  Remeshing 

Ar.l  J.  Peraire,  M,  Vahdati,  K.  Morgan  and  O.C. 
Zienkiewicz  -  Adaptive  Remeshing  for  Compress¬ 
ible  Flow  Computations;  J.  Comp.  Phys.  72, 
449-466  (1987). 

Ar.2  R,  Lohner  -  Adaptive  Remeshing  for  Transient 
Problems;  Comp.  Meth.  A ppl.  Mech.  Eng.  75, 
195-214  (1989). 

Ar.3  R.  Lohner  -  An  Adaptive  Finite  Element  Solver 
for  Transient  Problems  with  Moving  Bodies; 
Comp.Struct.  30,  303-317  (1988). 

Ar.4  R.  Lohner  -  Three-Dimensional  Fluid-Structure 
Interaction  Using  a  Finite  Element  Solver  and 
Adaptive  Remeshing;  Computer  Systems  in  En¬ 
gineering  1,  2-4,  257-272  (1990). 

Ar.5  R.  Tilch  -  PhD  Thesis,  CERFACS  (1991). 

GRID  GENERATION 

Books 

Gb.l  J.F.  Thompson,  Z.U.A.  Warsi  and  C.W.  Mastin 
-  Numerical  Grid  Generation:  Foundations  and 
Applications;  North  Holland  (1985). 

Gb.2  P.L.  George  -  Automatic  Mesh  Generation;  J. 
Wiley  &  Sons  (1991). 

Grid  Generation:  Domain  Definition  bv  Digitization 

Gd.l  M.  Merriam  and  T.  Barth  -  3D  CFD  in  a 
Day:  The  Laser  Digitizer  Project;  AIAA-91-1654 
(1991). 

Grid  Generation:  Advancing  FYont  Approach 

Ga.l  N.  van  Phai  -  Automatic  Mesh  generation  with 
Tetrahedron  Elements;  Ir.t.J. Sum. Meth. Eng 
18,237-289  (1982) 

Ga.2  S.H.  Lo  -  A  New  Mesh  Generation  Scheme  for  Ar¬ 
bitrary  Planar  Domains;  Int.J.Num.Meth.Eng. 
21, 1403-1426  (1985). 

Ga.3  J.  Perjure,  M.  Vahdati,  K.  Morgan  and  O.C. 
Zienkiewicz  -  Adaptive  Remeshing  for  Compress¬ 
ible  Flow  Computations;  J.  Comp.  Pbys.  72, 
449-466  (1987). 

Ga.4  R.  Lohner  -  Some  Useful  Data  Structures  for 
the  Generation  of  Unstructured  Grids;  Com- 
m.Appl.Num.Meth.  4,  123-135  (1988). 

Ga.5  R.  Lohner  and  P,  Parikh  -  Three-Dimensional 
Grid  Generation  by  the  Advancing  Front 
Method;  Int.J.Num.Meth.  Fluids  8,  1135-1149 
(1988). 

Ga.6  J.  Peraire,  K.  Morgan,  and  J.  Peiro  -  Un¬ 
structured  Finite  Element  Mesh  Generation  and 
Adaptive  Procedures  for  CFD.  AGARD-CF-464, 
18(1990). 


Ga.7  R.  Lohner,  J.  Camberos  and  M.  Merriam  •  Par¬ 
allel  Unstructured  Grid  Generation;  to  appear  in 
Comp.Mctb.Appl.Mecb.Eng.  (1992). 

Ga.8  F.  Uuet  -  Generation  de  Maillage  Automatique 
dans  les  Configurations  Tridimensionelles  Com¬ 
plexes  Utilization  d’une  Methode  de  ‘Front’; 
AGARD-CP-464,  17  (1990). 

Grid -Generation:  Voronoi  Approach 

Gv.l  J.C.  Cavendish  -  Automatic  Triangulation  of 
Arbitrary  Planar  Domains  for  the  Finite  Ele¬ 
ment  Method;  lat.J.Num.Metb.Eng.  8,  679-696 
(1974). 

Gv.2  D.T.  Lee  and  B.J.  Sehachter  -  Two  Algorithms 
for  Constructing  a  Delaunay  Triangulation;  In- 
t.J.Comp.Inf.Se.  9,3,  219-242  (1980). 

Gv.3  A.  Bowyer  -  Computing  Dirichlet  Tesseiations; 
The  Computer  Journal  24,2, 162-167  (1981). 

Gv.4  D.F.  Watson 

-  Computing  the  N-Dimensional  Delaunay  Tes- 
selation  with  Application  to  Voronoi  Poly  topes; 
The  Computer  Journal  24,2, 167-172  (1981). 

Gv.5  M.  Tanemura,  T.  Ogawa  and  N.  Ogita  -  A  New 
Algorithm  for  Three-  Dimensional  Voronoi  Tea- 
selation;  J.Comp.Pbys.  51,  191-207  (1983). 

Gv.6  S.W.  Sloan  and  G.T.  Houlsby  -  An  Imple¬ 
mentation  of  Watson’s  Algorithm  for  Comput¬ 
ing  2- Dimensional  Delaunay  Triangulations;  Ad- 
v.Eng.Soflwaie  6,4, 192-197  (1984). 

Gv,7  R.C.  Kirkpatrick  -  Nearest  Neighbor  Algorithm; 
pp.302-309  in  Springer  Lecture  Notes  in  Physics 
238  (M.J.  Fritta,  W.P.  Crowley  and  H.  Ttease 
eds.),  Springer  Verlag  (1985). 

Gv.8  D.N.  Shenton  and  Z.J.  Cendea  -  Three- 
Dimensional  Finite  Element  Mesh  Generation 
Using  Delaunay  Tesselation;  IEEE  Trans,  on 
Magnetics,  MAG-21,  2535-2538  (1985). 

Gv.9  J.C.  Cavendish,  D.A.  Field  and  W.H.  Frey  -  An 
Approach  to  Automatic  Three-Dimensional  Fi¬ 
nite  Element  Generation;  lot.  J.  Num.  Meth. 
Eng.  21,  329-347  (1985). 

Gv.10  A.  Jameson,  T.J.  Baker  and  N.P.  Weatherhill 
•  Calculation  of  Inviscid  Transonic  Flow  over  a 
Complete  Aircraft;  AIAA-86-0103  (1986). 

Gv.ll  D.G.  Holmes  and  D.D.  Snyder  -  The  Genera¬ 
tion  of  Unstructured  Triangular  Meshes  Using 
Delaunay  Triangulation;  pp.  643-652  in  Numeri¬ 
cal  Grid  Generation  in  Computational  Fluid  Dy¬ 
namics  (Sengupta  et  al.  eds.  ),  Pineridge  Press 
(1988). 

Gv.12  T.J.  Baker  -  Developments  and  'Rends  in  Three- 
Dimensional  Mesh  Generation.  Appi.  Num. 
Math.  5,  275-304  (1989). 


K-5K 


Grid  Generation:  Modified  Octree  Approach 

Go.l  M.A.  Yerry  and  M.S. 

Shepard  -  Automatic  Three* Dimensional  Mesh 
Generation  by  the  Modified-Octree  Technique; 
l nt.J.Num.Meth.Eng,  20,  1965*1990  (1984). 

Grid  Generation:  Macro  Element  Approach 

Gm.l  O.C.  Zienkiewicz  and  D.V.  Phillips  -  An  Auto¬ 
matic  Mesh  Generation  Scheme  for  Plane  and 
Curved  Surfaces  by  Isoparametric  Co-ordinates; 
IntJ.Num.Meth.Eng.  3,  519-528  (1971). 

Gm.2  J.F.  Thompson  -  A  Composite  Grid  Genera¬ 
tion  Code  for  General  3S  Regions  -  The  EAGLE 
Code;  AL4A  J.  26,  3,  271ff  (1988). 

Gm.3  J.P.  Steinbrenner,  J.R.  Chawner  and  C.L.  Fouts 
-  A  Structured  Approach  to  Interactive  Multi¬ 
ple  Block  Grid  Generation;  AGARD-CP-464,  8 
(1990). 

Gm.5  S.  Allwright  -  Multiblock  Topology  Specification 
and  Grid  Generation  for  Complete  Aircraft  Con¬ 
figurations;  AGARD-CP-464,  11  (1990). 

Grid  Generation:  Ftom  Regular  Background  Grid 

Gr.l  W.C.  Thacker,  A.  Gonzalez  and  G.E.  Putland  - 
A  Method  for  Automating  the  Construction  of 

Irregular  Computational  Grids  for  Storm  Surge 
Forecast  Models;  J.Comp.Phys.  37,  371-387 
(1980). 

Gr-2  D.G.  Holmes  and  S.C.  Lamson  -  Compressible 
Flow  Solutions  on  Adaptive  Triangular  Meshes; 
Open  Forum  AIAA  -  Reno’86  -  Meeting  (1980). 

Data  Structures.  Algorithms  and  Programming 

Techniques 

Ds.l  D.  E.  Knuth,  “The  Art  of  Computer  Program¬ 
ming,*’  Vola.  1-3  (Addison- Wesley,  Reading,  MA, 
1973). 

Ds.2  R.  Sedgewick  -  Algorithms  ;  Addison- Wesley 
(1983). 

Ds.3  J.W .J.  Williams-  Heapsort;  Comm. ACM  7,  347- 
348  (1964). 

Ds.4  R.W.  Floyd  -  Tteesort  3;  Comm.ACM  7,  701 
(1964). 


EFFECTIVE  USE  OF  SUPERCOMPUTER 
HARDWARE 


Scatter-add 

Sc.l  R.  Diekkiimper  -  Vectorised  Finite  Element 
Analysis  of  Nonlinear  Problems  in  Structural  Dy¬ 
namics;  pp. 293-298  in  Proc  Parallel  Computing 
*83  (M.  Feilmeicr,  G.  Joubert  and  U.  Schendcl 
eds.),  North  Holland  (1984). 

Sc. 2  R.  Lohner  and  K.  Morgan  *  Finite  Element  Meth¬ 
ods  on  Supercomputers:  The’  Scatter  Problem; 
Proc.  NU META ’85  Conf.  (3.  Middleton  and  G. 
Pande  eds.),  987-990,  A.A.  Balkema,  Rotterdam, 
1985. 

Sc. 3  S1NDA  Thermal  Analyzer  Manual  (1980) 

Sc .4  D.J.  Mavriplis  -  Three-Dimensional  Unstruc¬ 
tured  Multigrid  for  the  Euler  Equations;  AIAA- 
91-1549  (1991). 

Sc. 5  J.  Peraire,  J.  Peiro  and  K.  Morgan  -  A  3-D  Finite 
Element  Multigrid  Solver  for  the  Euler  Equation- 
si  AIAA-92-0449  (1992). 


Connection  Machine  Issues  and  Anllication9 

Cm.l  CM  routers  NEED  REF 

Cm.2  CM  renumbering  schemes  NEED  REF 

Cm.3  D.  Jespersen  and  C.  Levit  -  A  Computational 
Fluid  Dynamics  Algorithm  on.  a  Massively  Par¬ 
allel  Machine;  AIAA-89-1936-CP  (1989). 

Cm.4  L.N.  Long,  M.M.S.  Khan  and  H.T.  Sharp 
-  A  Massively  Parallel  Three-Dimensional 
Euler/Navier-Stokes  Method;  AIAA-89-1937-CP 
(1989). 

Cm.5  E.S.  Oran,  J.P.  Boris,  and  E.F.  Brown  - 
Fluid- Dynamic  Computations  on  a  Connection 
Machine  -  Preliminary  Timings  and  Complex 
Boundary  Conditions;  AIAA-90-0335  (1990). 

Cm.6  CM  application  NEED  REF 

MlMfl  Machines:  Issues  and  Apllications 

Mi.l  H.  Simon  -  Partitioning  of  Unstructured  Mesh 
Problems  for  Parallel  Processing;  Proc.  Conf. 
Parallel  Meth.  on  Large  Scale  Structural  Analy¬ 
sis  and  Physics  Appl.,  Pergamon  Press  (1991). 


y»  i 


A  Frontal  Approach  for  Node  Generation 
in  Delaunay  Triangulations. 

J.-D.  Muller  * 

I\L.  Roc  ' 

H.  Doconinck  I 

CFD-Group  •  von  Kantian  Institute  fot  Fluid  Dynamics 
Aerospace  Department  -  University  of  Michigan 


1  SUMMARY 

A  new  algorithm  for  the  generation  of  the  interior  nodes 
for  Delaunay  triangulations  is  given.  The  method  uses  a 
background  grid  to  interpolate  local  mesh  size  parar.tetets 
that  is  taken  from  the  ttiangulation  of  the  given  boundary 
nodes  .  Geometric  criteria  ate  used  to  find  a  set  of  nodes 
in  a  frontal  manner.  This  set  is  subsequently  introduced 
into  the  existing  mesh,  thus  providing  an  updated  De¬ 
launay  triangulation.  The  ptoccdiirc  is  completed  when 
no  more  improvement  of  the  grid  by  inserting  new  node* 
can  be  achieved. 

2  INTRODUCTION 

The  purpose  of  this  lecture  series  is  to  discuss  the  achieve¬ 
ments  and  recent  advances  made  in  the  solution  of 
the  Viulcr-  and  Navier-Stokes  equations  on  unstructured 
grids.  One  of  the  main  benefits  related  to  the  unstruc¬ 
tured  grid  approach  is  the  simplicity  with  which  an  un¬ 
structured  grid  can  be  tailored  around  very  complex  ge¬ 
ometries  and  be  adapted  to  the  solution  in  a  subsequent 
interaction  process  with  the  flow  solver.  Such  advantage 
will  become  even  mote  important  as  three-dimensional 
calculations  become  commonplace. 

It  is  desirable  that  the  inesli  can  be  generated  with  min¬ 
imum  input  from  the  user.  Ideally,  one  would  wish  to 
specify  only  the  boundary  geometry,  and  perhaps  a  func¬ 
tion  to  specify  some  desired  mesh  size.  The  internal  point 
cloud  should  then  be  found  automatically  by  the  grid 
generation  code.  Two  methods  that  satisfy  these  criteria 
have  become  popular  and  are  discussed  in  other  lecture 
series  contributions  : 

•  The  Advancing  Front  method  of  Peraire  et  al.  [1], 
detailed  in  the  contributions  of  Lohner  and  Morgan 
Ar  Peraitc. 

•  Holmes'  refinement  procedure  [2)  used  in  a  Delau¬ 
nay  triangulation  [3]  as  discussed  in  the  lectures  by 
Barth. 

The  approach  we  propose  here  combines  ideas  from  both 
methods. 

In  itself  Delaunay  triangulation  docs  not  provide  an  in¬ 
terior  point  cloud.  A  popular  approach  to  overcome  this 
deficiency  has  been  outlined  by  Holmes  [2].  A  Delau¬ 
nay  triangulation  of  the  boundary  nodes  is  taken  as  an 
initial  grid.  Figure  1  gives  such  a  triangulation  for  the 
three  element  aerofoil  configuration  shown  in  figure  II. 
The  initial  triangulatiou  consists  of  large,  very  skewed 
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triangles  that  ate  found  to  exceed  certain  thresholds  of 
maximum  area  or  maximum  skewness.  Holmes  proposes 
to  measure  skewness  as  the  ratio  of  the  radius  of  the  cir¬ 
cumscribed  circle  over  twice  the  radius  of  the  inscribed 
circle. 

Once  such  a  bad  triangle  is  detected  it  will  be  refined 
by  the  insertion  of  a  new  node  at  the  circiuncentrc  of 
that  triangle.  Refinement  is  performed  on  the  largest 
triangle  in  the  grid  until  all  triangles  ate  smaller  than 
a  first  threshold  value.  Then  refinement  continues  on 
the  skewed  triangles  starting  with  the  one  having  the 
largest  circumcirclc.  The  final  grid  is  obtained  after  all 
skewed  triangles  arc  smaller  than  a  second  area  threshold. 
Refining  on  skewness  yields  an  implicit  mechanism  that 
increases  the  node  density  very  close  to  boundaries  with 
a  finer  node  spacing. 

However,  searching  fot  the  largest  cell  or  a  skewed  cell 
with  the  largest  circumcirclc  for  each  new  node  is  a  rather 
costly  procedure  and  the  skewness  criterion  »■.  expensive 
to  evaluate  since  it  involves  three  square  roots  dining  the 
calculation  of  the  circle  ratio.  Also  the  grids  produced  by 
this  approach  are  not  as  regular  as  the  ones  given  by  the 
Advancing  Front  method.  This  is  due  to  the  fact  that 
the  icfincmcnt  process  is  much  more  random. 

3  FRONTAL  NODE  CREATION 

We  introduce  here  a  technique  that  combines  the  ideas  of 
refining  a  Delaunay  triangulatiou  of  boundary  nodes  with 
the  ideas  of  frontal  advance.  In  owr  method,  the  front 
will  take  the  form  of  a  boundary  between  a  ‘nicely’  tri¬ 
angulated  region  and  a  'badly'  triangulated  region.  The 
method  will  be  described  for  two  dimensions,  but  we  be¬ 
lieve  there  will  be  few  obstacles  to  implementing  it  in 
three. 

It  one  looks  at  a  Delaunay  triangulatiou  of  a  set  of  bound¬ 
ary  nodes  (as  in  figure  1),  one  can  observe  that  almost 
every  boundary  face  is  either  the  short  face  of  a  triangle 
with  one  very  acute  angle  or  else  one  of  two  short  faces 
in  a  triangle  with  one  very  obutsc  angle.  In  accordance 
with  our  idea  of  a  front  that  divides  nice  triangles  from 
bad  triangles,  we  take  the  boundary  to  define  the  initial 
position  of  such  a  front,  'l'o  begin  with,  we  have  no  nice 
triangles,  but  we  will  introduce  a  layer  of  well-positioned 
nodes  that  will  allow  i  lie  front  to  advance. 

'I'he  construction  of  the  new  nodes  is  an  easy  task.  We 
simply  form  an  equilateral  triangle  with  the  frontal  face 
and  either  stretch  or  compress  it  to  better  match  the 
spacing  requirements  of  the  background  mesh,  this  gives 
idea)  spacing  with  the  two  nodes  that  form  the  face,  A 
further  check  is  required  to  make  sure  the  new  node  is 
sufficiently  distant  from  the  remaining  nodes  in  the  grid 
and  from  the  other  new  nodes.  The  desired  distance  is  in- 


I'ig.  I;  Triangulation  of  boundary  nodes.  The  nodes 
on  the  boundary  of  a  three  element  aerofoil 
arc  connected  to  nodes  on  the  outer  boundary 
(not  shown),  nodes  on  another  boundary  >>r 
nodes  on  the  same  boundary. 


terpolatcd  on  a  background  mesh  that  uniquely  specifics 
the  distance  between  nodes  everywhere  in  the  domain. 
Nodes  that  exhibit  bad  spacing  arc  either  merged  with 
other  nodes  or  discarded.  With  these  new  nodes  in  place, 
the  Delaunay  algorithm  is  re-run  and  will  readily  accept 
the  proposed,  nice  triangles  as  it  resents  skewness  in  Us 
triangulation.  We  will  name  these  triangles  ‘explicit’  in 
the  following.  Also  nice  triangles  between  the  new  nodes 
will  be  formed  as  they  arc  guaranteed  to  be  spaced  nicely 
as  well,  the  ‘implicit’  triangles.  In  the  rest  of  the  domain 
Delaunay  still  has  to  construct  acute  cells,  though  with 
slightly  improved  shape. 


Fig.  2:  Explicit  triangles  (striped)  and  implicit  trian¬ 
gles  (squared)  that  are  formed  along  the  old 
front  and  build  the  new  front. 

Again  the  short  faces  of  these  acute  cells  denote  a  frontier 
between  the  region  with  nice  cells  and  the  region  still 
waiting  to  be  refined  figu'i  2,  and  the  process  can  be 
repeated  until  all  bad  triangles  have  vanished.  Hence, 
the  algorithm  can  i>e  cast  intr  the  following  steps: 


1.  detect  all  bad  triangles  in  the  grid  and  find  their 
short  faces, 

2.  find  a  set  of  nodes  to  form  nice  triangles  with  the 
short  faces, 

3.  check  whether  the  new  nodes  arc  not  too  close  to  any 
other  node  already  introduced  into  the  structure, 

4.  check  whether  the  new  nodes  are  not  too  close  to 
any  other  new  node, 

5.  rctrianguUtc  with  the  set  of  new  nodes. 

The  steps  will  be  repeated  until  no  w  o  improvement  by 
node  insertion  c«vs  be  achieved. 

3.1  Front  Detection. 

The  front  consists  of  the  interface  between  the  region  of 
properly  refined  triangles  and  the  unrefined  region.  A 
refinement  should  only  take  place  on  a  face  that  has  a 
refutable  triangle  on  one  aide  and  an  unrcfinablc  on  the 
other.  If  refining  was  merely  based  on  side  ratios,  an  ob¬ 
tuse  triangle  in  the  front  would  lead  to  the  introduction  of 
three  nodes.  Figure  3  shows  the  two  nodes  that  would  be 
formed  from  the  two  short  faces  in  the  front  of  the  obtuse 
triangle  and  the  node  from  the  face  of  the  acute  triangle 
that  neighbors  the  obtuse  one.  Not  that  this  extra  node 
is  badly  placed,  but  this  node  should  be  formed  only  in 
the  next  stage  This  would  lead  to  an  irregular  front  with 
scattered  faces  'hat  may  not  be  connnectcd  and  the  sub¬ 
sequent  refining  would  have  much  of  the  randomness  of 
Holmes’  method. 


Fig.  3:  Obtuse  triangle  along  the  front  with  three 
new  nodes  formed. 

A  triangle  is  unrcfinable  if  cither  it  is  not  skewed  or  it  is 
skewed  but  node  spacing  around  the  cell  docs  not  allow 
further  insertion.  Checking  is  simplified  by  keeping  a 
status  flag  for  each  triangle  to  only  examine  each  cell 
once.  It  is  to  be  emphasized  that  contrary  to  the  usual 
Advancing  Front  method  no  explicit  tracking  of  the  front 
ami  thus  no  expensive  overhead  is  required. 

3.2  Node  Construction. 

The  idea)  node  to  be.  placed  in  the  mesh  would  satisfy 
the  distance  criterion  with  all  neighboring  nodes,  i.e.  the 
distance  to  all  nodes  that  it  will  be  connected  to  equals 
the  background  spacing  evaluated  at  the  midpoint  be¬ 
tween  these  two.  Clearly,  this  is  an  ill-posed  problem. 
Dut  even  trying  to  satisfy  the  distance  condition  with 
the  two  nodes  of  the  frontal  face  leads  to  a  system  of 
two  quadratic  equations.  The  task  will  become  more 
amenable  with  the  restriction  to  isocoles  triangles.  We 


wilt  carry  out  the  geometrical  construction  in  an  approx¬ 
imate  manner  leading  to  only  one  lin*ai  equation. 
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Fig.  5:  Scalar  product  criterion  to  walk  from  a  cell 
towards  a  target 


Fig.  4:  Short  face  and  constructed  node  The  new 
node  3  ami  the  spacing  reference  node  4  lie 
on  the  perpendicular  bisector  of  the  face  3. 

We  approximate  the  length  of  the  sides  (1  and  12  opposite 
nodes  1  and  2  by  2/-y/3<  of  the  height  l  as  found  in  an 
equilateral  triangle.  Requiting  that  this  approximated 
sidclciigth  equals  the  desired  spacing  hi  midway  between 
node  3  ami  the  midpoint  of  the  base  M  (figure  4),  we 
find 


In 


-^=|x3  -  xv|  =  (xj  -  xs<)  V/»  -f 


where  h,\i  denotes  the  desired  spacing  at  M,  Vh  is  the 
local  gradient  of  the  background  spacing  and  Xa,x,w  are 
the  position  vectors  of  nodes  3  and  M .  As  we  place  the 
new  node  along  the  perpendicular  bisector,  we  can  write 


at  each  step  a  finite  distance  towards  our  point  of  destiny 
we  must  reach  our  target  so  this  search  procedure  always 
converges. 

The  direction  to  turn  to  is  given  by  the  maximum  scalar 
product  of  the  normal  on  the  face  and  the  vector  from  the 
midpoint  of  that  face  to  the  target  (figure  5).  Of  course, 
only  two  directions  have  to  be  tested  once  the  sv  ;.ch  is 
on  its  way,  also  it  would  be  rather  wasteful  to  use  unit 
normals.  The  search  is  finished  when  all  scalar  products 
arc  non-positive,  indicating  that  the  target  lies  either  in 
the  cell  or  on  a  face  of  the  cell. 

In  the  worst  ease  the  cost  of  this  search  is  0(\/7V)  on 
a  mesh  with  N  nodes.  However,  once  the  foreground 
and  background  cells  associated  with  the  new  node  are 
determined,  all  of  the  triangles  in  the  vicinity,  where  most 
of  the  remaining  operations  take  place,  arc  found  in  a  few 
steps. 


,  X3  -  X,\f  , 

x3  -  x»(  ■-  — - 7  =  hta. 

|Xj  -  X.lf| 

where  n.i  is  the  unit  normal  on  the  base  pointing  towards 
the  triangle  to  be  refined.  The  height  for  a  triangle  with 
counterclockwise  sense  is  thus 

.  _  _ h  ,\1 

*  &  " 

Note  iliat  in  the  given  form  the  altitude  of  the  explicit 
triangle  is  independent  of  the  length  of  the  base  This 
conserves  lliv.  thickness  of  the  layer  of  cells  introduced 
even  if  the  length  of  the  faces  varies  strongly  (figure  9). 

3.3  Searching. 

'The  efficiency  of  unstructured  grid  generation  methods 
is  very  dependent  on  the  way  specific  nodes  or  cells  arc 
found  in  the  grid,  for  example  Bowyer’s  algorithm  re¬ 
quires  the  search  for  a  circiinicircle  that  contains  a  new 
node  ami  interpolation  on  a  background  mesh  involves 
finding  the  background  cell  that  contains  the  point  of  in¬ 
terest.  As  already  stated,  an  implementation  of  Bowyer’s 
algorithm  usually  comes  with  the  storage  of  the  neigh¬ 
bors  to  each  triangle  and  the  position  of  its  citcumcen- 
tre.  Hence,  a  straightforward  way  to  locate  a  position 
in  a  Delaunay  triangulation  wo»>U  be  to  walk  along  the 
Dirichlet  tessclation  from  circuinccntrc  to  circumcciitre 
closer  towards  the  target.  But  this  method  does  not  nec¬ 
essarily  converge  as  a  Voronoi  vertex  can  lie  outside  the 
its  associated  triangle. 

A  method  of  similar  computational  cost  is  to  walk  from 
cell  to  cell  in  the  direction  of  the  target.  As  we  progress 


3.4  Background  Mesh. 

The  Delaunay  tnangiilatioii  of  all  boundary  nodes  is  com¬ 
puted  as  an  initial  triangulation  to  begin  the  point  gen¬ 
eration  process.  This  triangiilution  provides  at  no  extra 
cost  a  suitable  background  mesh  to  provide  a  local  value 
of  desired  distance  between  nodes  at  any  location  within 
the  convex  hull.  It  will  be  assumed  here  that  this  desir¬ 
able  distance-  is  a  piecewise  linear  function  of  position, 
interpolated  between  the  nodal  values  of  a  triangle  in 
the  background  grid.  The  spacing  value  at  that  node  is 
computed  as  the  average  distance  to  its  two  neighboring 
nodes  on  the  boundary.  The  gradient  of  the  spacing  h  is 
evaluated  for  each  t  riangle  in  the  background  mesh  using 
Gauss1  Theorem, 


where  5  stands  for  the  surface  ami  s,  for  the  scaled  out¬ 
ward  normals  of  the  background  cell. 

A  linear  variation  between  the  fine  spacing  on  a  body 
and  the  coarse  spacing  on  a  far-ficld  boundary  is  ob¬ 
tained  when  the  background  triangle  connects  directly 
from  the  interior  to  the  exterior  boundary.  But  along 
concave  contours  it  may  happen  that  Delaunay  connects 
between  finely  spaced  interior  boundaries  and  the  back¬ 
ground  grid  will  specify  a  too  large  area  of  fine  spacing. 
Figure  1  gives  an  example  of  such  an  illconnected  back¬ 
ground  mesh.  It  shows  a  close-up  of  a  multi  element 
aerofoil,  obscured  by  the  triangles  formed  inside  the  ele¬ 
ments.  A  clearer  view  of  the  configuration  can  be  seen  in 
figure  11.  The  triangles  leaving  the  frame  are  connected 
to  the  outer  boundary.  However,  the  initial  triangulation 
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Also  connect*  the  finely  discretized  trailing  edge  of  the 
main  flap  with  the  lowei  side  of  the  main  aerofoil  and 
implies  an  undesirable  high  node  density  in  the  entire 
illconnccted  area  between  the  two  elements. 

The  problem  can  easily  be  circumvented  by  the  intro¬ 
duction  of  extra  nodes  into  the  background  mesh.  As  the 
background  triangulation  consists  of  very  skinny  trian¬ 
gles,  only  very  few  nodes  will  be  needed  to  break  up  the 
unwanted  connectivity.  If  we  connect  these  few  nodes  to 
a  boundary  in  the  background  grid  we  implicitly  define 
the  spacing  for  these  nodes  in  the  same  way  as  for  the 
other  boundaries.  However,  this  procedure  requires  extra 
user  input,  usually  after  viewing  the  background  grid  and 
visualizing  the  grid  becomes  ditlicult  in  three  dimensions, 
'lb  be  consistent  with  the  philosophy  of  minimal  user  in¬ 
put  we  should  have  the  program  introduce  the  necessary 
nodes  and  merely  ask  the  user  which  boundaries  he  docs 
not  want  to  have  connected.  The  procedure  will  be  to 
detect  an  illicit  liaison  and  place  a  background  node,  in- 
between.  During  a  subsequent  retriangulation  most  if  uot 
all  of  the  triangles  shared  between  the  two  bodies  will  be 
broken  and  again  few  extra  nodes  suffice  Figure  6  shows 
the  background  mesh  modified  by  automatic  insertion. 
Four  nodes  have  been  introduced  from  triangles  in  the 
area  between  the  main  aerofoil  ahd  the  main  flap. 


Fig.  6:  Background  grid  automatically  modified  by 
the  insertion  of  four  nodes  to  break  up  un¬ 
wanted  connectivity  Two  of  these  nodes  are 
shown  between  the  mam  aerofoil  and  the  sec¬ 
ond  flap. 


The  remaining  question  is  what  spacing  to  apply  to  these 
new  uodcs.  One  would  like  the  spacing  to  rise  smoothly 
from  every  body  node  into  the  domain  to  finally  match 
ihc  spacing  of  the  outer  boundary.  This  corresponds  to 
extrapolating  the  spacing  with  an  average  gradient  from 
every  boundary  node  towards  the  automatically  inserted 
node  and  take  the  minimum  of  all  these  values  -  an  un¬ 
reasonably  costly  procedure. 

Fortunately,  the  Delaunay  properties  make  the  task  at 
hand  a  lot  more  amenable.  If  we  place  the  new  node  to 
break  an  unwanted  triangle  at  the  Voronov  vertex  of  that 


ttianglc,  we  know  that  there  is  no  node  closer  to  the  new 
node  than  the  three  nodes  forming  that  triangle.  More¬ 
over,  the  new  node  is  equidistant  from  both  ill-connected 
boundaries.  We  then  extrapolate  from  the  spacing  of  the 
more  finely  discretized  boundary  using  the  average  gra¬ 
dient  of  the  initial  triangulation.  . 

3.5  Skewness  Threshold. 

So  far  the  term  'bad'  has  been  used  for  long  skinny  tri¬ 
angles  without  specifying  on  what  wc  base  this  label. 
From  the  previous  discussion  it  follows  that  a  criterion  is 
needed  that  is  easy  to  evaluate  and  that  discriminates  the 
faces  to  be  used  in  the  front.  An  obvious  and  inexpensive 
choice  for  quantifying  the  proportions  of  a  triangle  is  to 
look  at  ratios  of  the  squared  sidclcngth  over  the  squared 
maximum  sidclcngth.  A  triangle  will  be  called  ‘bad’  once 
any  of  its  three  side  ratios  drops  below  a  threshold.  Con¬ 
sidering  the  fact  that  a  triangle  is  formed  by  placing  a 
nod.;  somewhere  along  the  perpendicular  bisector  of  the 
base,  one  can  estimate  threshold  values  for  the  side  ratios. 
A  first  estimate  can  be  derived  for  an  acute  triangle  on 
a  zero-gradient  background.  In  this  case  Delaunay  trian- 
gulation  will  always  form  an  equilateral  explicit  triangle 
with  the  base,  and  an  implicit  triangle  with  the  new  node 
and  the  distant  node  of  the  previous  bad  triangle. 


b 


(»)  (b) 

Fig.  7:  (a)  Refining  of  an  acute  triangle  (dashed)  into 

an  equilateral  triangle  (bottom)  and  an  acute 
‘implicit’  triangle  (top);  (b)  Refining  of  an 
obtuse  triangle  (dashed)  into  two  triangles 
(full). 

Figure  7  (a)  shows  the  geometry  in  question.Tlie  worst 
'implicit'  triangle  is  produced  when  the  distant  node  of 
the  acute  triangle  also  lies  on  the  perpendicular  bisector. 
If  we  let  the  dashed  triangle  hccome  less  and  less  acute 
cv  will  grow  and  fl  will  become  smaller.  Both  ancles  will 
be  equal  if  6  ss  .64#,  so  that  refining  an  acute  triangle 
with  b  <  .64#  will  make  the  grid  worse,  Hence  for  acute 
triangles  a  good  threshold  is  the  side  ratio  of  a  triangle 
with  b  s  .64#  or 


Similar  reasoning  applies  to  the  obtuse  triangle  in  figure  7 
(b).  Here  the  two  nodes  formed  perpendicular  to  the  two 
short  faces  will  be  merged  subsequently  as  they  arc  too 
close  to  each  other.  Hence  we  have  to  consider  the  new 
no»le  to  be  placed  on  the  angular  bisector.  The  smallest 
angle  in  the  old  triangulation  was  «;  in  the  new  triangu¬ 
lation  it  is  0.  Since  2(or  -F/f)  =  ir  matters  only  improve  if 
a  <  k/4.  That  is,  wc  should  only  refine  obtuse  triangles 
for  which  the  side  ratio  is  less  than 
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It  turns  out  that  the  quality  of  the  triangulation  is  some¬ 
what  insensitive  to  the  choice  of  the  side  ratio  threshold 
and  any  value  in  the  range  of  the  two  estimates  will  give 
good  results.  This  allows  to  use  the  same  threshold  for 
both  obtuse  and  acute  triangles.  The  triangulation  will 
change  with  a  different  threshold  but  the  minimum  angles 
found  will  remain  virtually  unchanged.  Strong  gradients 
in  the  background  grid  might  lead  to  the  formation  of 
explicit  triangles  that  exceed  the  threshold  in  the  side 
ratio.  Therefore  the  altitude  of  the  triangle  to  be  formed 
will  be  bounded  to  the  height  of  an  obtuse  triangle  with 
the  threshold  side  length  ratio  and  the  height  of  its  acute 
counterpart. 

->m*n  \  ^  <  // Srnax  \  _  1^ 
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3.6  Spacing  Check. 

The  spacing  check  balances  the  mechanism  of  node  intro¬ 
duction  due  to  excessive  skewness  by  rejecting  or  merging 
nodes;  in  this  way  grid  quality  is  assured  for  the  implicit 
triangles.  We  might  want  to  reject  a  new  node  because  it 
falls  too  close  tho  some  existing  node,  or  because  it  falls 
too  close  to  another  new  node.  The  two  cases  have  to  be 
dealt  with  separately. 

Looking  for  close  nodes  that  are  already  introduced,  we 
can  make  use  of  the  fact,  that  a  Delaunay  mesh  con¬ 
structed  with  Bowycr’s  algorithm  covers  the  entile  con¬ 
vex  hull.  We  can  construct  an  enlarged  circle  around  each 
triangle  which  is  the  circumcirclc  with  an  added  rim  of 
the  required  distance  for  the  new  node.  If  the  new  node 
does  not  fall  within  that  enlarged  circle,  the  distance  be¬ 
tween  the  new  node  and  the  nodes  of  the  triangle  is  at 
leas,  t  :  required  spacing  (figure  8). 


Fig.  8:  Figure  8:  A  new  node  (A)  is  contained  in  the 
triangL  146,  but  closest  to  the  node  2.  The 
enlarged  circles  124,  132,  342  contain  A  and 
require  testing.  The  enlarged  ciicle  around 
456  does  not  contain  A  and  excludes  nodes  4, 
5  and  6  from  testing. 

In  a  similar  fashion  to  the  tree  search  during  the  inser¬ 
tion  procedure,  the  simply  connected  region  can  be  de¬ 
termined  where  nodes  that  are  already  introduced  might 
be  too  close  to  a  new  node.  Once  a  new  node  is  found  to 
be  too  close  to  another  old  one  it  Ls  discarded  from  the 
list. 


One  is  less  fortunate  with  checking  the  distance  towards 
the  other  new  nodes  that  arc  also  waiting  to  be  intro¬ 
duced.  Along  the  front  we  might  find  a  set  of  very  acute 
triangles  that  can  lie  rather  oblique  to  it  .  The  initial  front 
along  the  boundaries  in  figure  1  can  serve  as  a  good  exam¬ 
ple.  New  nodes  that  are  too  close  to  each  other  may  not 
lie  in  neighboring  triangles  and  one  cannot  make  use  of 
the  underlying  grid.  Extensive  search  throughout  the  list 
of  new  nodes  has  to  be  performed,  but  the  list  contains 
only  O(VN)  nodes  at  a  time.  This  advocates  the  use 
of  an  intelligent  data  structure  that  provides  some  kind 
of  bucketing  to  further  reduce  the  cost  of  searching  and 
will  retain  an  optimal  count  of  operations  of  0(N  log  N). 
Once  two  new  nodes  are  found  to  be  too  close,  they  are 
merged.  This  merging  is  actually  the  only  step  in  the  al¬ 
gorithm  that  introduces  irregularity  into  an  initially  reg¬ 
ular  mesh.  All  other  steps  arc  independent  of  the  order 
in  which  triangles  or  nodes  are  encountered.  While  it  is 
generally  not  important  which  neighbors  are  merged,  we 
do  want  to  have  preferred  merging  of  the  two  nodes  that 
are  formed  from  the  two  short  faces  of  an  obtuse  trian¬ 
gle  as  shown  in  figure  7  (b).  If  one  searches  backwards 
through  the  list  of  new  nodes,  this  pair  is  encountered 
first  and  treated  with  higher  priority. 

In  order  to  achieve  large  minimum  angles  we  may  toler¬ 
ate  nodes  being  closer  to  each  other  than  allowed  b>  the 
background  mesh.  Otherwise  the  skewness  mechanism 
providing  refinement  may  be  counterbalanced  too  much. 
Initially  the  number  of  new  nodes  is  completely  deter¬ 
mined  by  the  number  of  boundary  nodes.  As  the  front 
propagates  outwards,  the  nodes  will  eventually  become 
too  numerous  as  the  bv  ’'ground  mesh  demands  more 
and  more  distance  be*  ,1  the  nodes  and  the  spacing 
check  will  coarsen  the  In  this  way  liberal  spacing 

will  allow  more  completely  regular  rows  with  the  original 
number  of  nodes  around  the  bodies.  On  the  other  hand, 
being  too  lax  allows  node  insertion  where  no  improve¬ 
ment  can  be  achieved.  Good  values  for  a  tolerance  factor 
to  be  multiplied  with  the  desired  spacing  have  been  found 
to  lie  in  the  range  between  .5  and  .75. 

It  is  to  be  noted  that  no  criterion  for  too  large  cells  is 
needed  if  the  front  emerges  from  the  finely  spaced  bound¬ 
aries.  The  spacing  meek  tnism  will  gradually  coarsen  up 
that  front  until  it  meets  the  outer  boundary.  Further 
refinement  in  the  field  can  be  left  to  solution  adaptive  in¬ 
teraction  with  the  solver.  However,  the  implementation 
would  net  pose  any  problems.  If,  after  rctriangulation, 
two  connected  nodes  are  fo»  nd  to  be  too  far  apart,  an¬ 
other  node  can  be  introduced  between  them. 

3.7  Computational  Cost. 

Suppose  that  a  total  of  N  nodes  are  generated  in  such  a 
way  that  Np  new  nodes  arc  created  every  time  the  front 
advances.  Bowyer’s  algorithm  takes  0(log  N)  operations 
to  introduce  a  single  node  into  a  triangulation.  With 
a  dissecting  data  structure  like  a  split-tree  the  cost  of 
searching  the  list  of  NF  new  nodes  requires  O(log  NF) 
operations.  One  needs  N}~p  stages  to  construct  the  full 
triangulation;  the  total  cost  is  thus  0(N(p  +  1)  log  N). 
As  p  ranges  between  0  and  1,  the  number  of  operations 
necessary  increases  by  a  factor  of  two  in  the  worst  case 
when  all  nodes  are  formed  in  one  single  stage,  compared 
to  a  triangulation  of  specified  nodes.  Hence,  the  method 
is  asymptotically  optimal. 

Actual  times  are  given  for  the  three  element  aerofoil  case 
given  in  figures  9  through  12.  The  initial  triangjilation 
of  the  328  boundary  nodes  took  .77  seconds  on  a  DEC 
5000/200,  i.e,  .0023  scc/node.  The  insertion  algorithm 
created  2047  interior  nodes  and  used  27  seconds,  i.e  .013 
sec/node.  Thus,  the  method  for  generating  new  nodes 
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and  triangulating  them  costs  about  five  times  as  much 
computer  time  per  node  as  the  triangulation  alone.  Note 
that  the  current  implementation  docs  not  employ  any 
tree  data  structure  and  therefore  the  given  times  could 
be  reduced  further. 

4  EXAMPLES 

A  classic  case  for  an  unstructured  grid  generator  is  the 
grid  around  a  multi-element  aerofoil.  Structured  grid 
generation  already  rcquiti  s  sophisticated  extensions  to 
deal  with  this  problem.  The  background  grid  for  the 
aerofoil  given  in  figure  1  "as  modified  by  the  automatic 
insertion  of  four  nodes  as  seen  in  figure  6.  The  initial  Iri- 
angulatiou  is  boundary  conformal  and  does  not  require 
introduction  of  extra  nodes  according  Weatherill’s  apos- 
teriori  criterion  [1 1]. 


Fig.  9:  Grid  around  three  element  aerofoil  after  three 
rows  of  nodes  have  been  inserted.  Note  the 
coarsening  of  the  front  in  the  second  row  on 
the  upper  surface.  The  tracking  prevents  a 
breakdown  of  the  front  in  the  region  between 
the  three  aerofoils. 


Figure  9  shows  the  grid  after  three  rows  of  nodes  have 
been  constructed,  the  resulting  grid  is  shown  in  figure  10, 
a  close-up  of  the  aerofoil  in  figure  11.  The  different  rows 
of  nodes  can  be  identified  clearly  in  the  finai  triangula- 
tion. 

On  the  upper  surface  of  the  main  aerofoil  it  can  be 
demonstrated  how  essential  the  construction  algorithm 
is  to  grid  quality.  In  the  second  row  the  spacing  check 
has  eliminated  several  nodes  and  the  length  of  the  faces 
in  the  new  front  varies  from  normal  to  doubled.  Still 
the  nodes  in  the  third  row  are  aligned  evenly,  providing 
nearly  equilateral  triangles  again.  The  disturbances  in¬ 
troduced  in  the  second  row  are  completely  eliminated  in 
the  fourth  row, 

The  regularity  of  the  grid  is  entirely  due  to  the  frontal 
insertion,  no  smoothing  filter  .yus  applied.  Figure  12 
shows  a  detail  of  the  grid  between  the  three  elements. 
The  fronts  do  not  break  down  and  merge  into  each  other 
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Fig.  10:  Grid  around  thrcc-elcment  aerofoil. 


Fig.  11:  Close-up  of  grid  around  three-element  aero¬ 
foil. 


Fig.  12:  Detail  of  grid  around  three-element  aerofoil. 


smoothly.  Only  very  few  triangles  with  maximum  an¬ 
gles  exceeding  90°  can  be  found.  If  fronts  are  aligned 
to  each  other,  the  resulting  point  cloud  is  perfectly  reg¬ 
ular  as  between  the  main  (lap  and  the  vane  flap.  The 
gradual  increase  in  node  spacing  between  the  main  aero¬ 
foil  and  the  main  flap  is  due  to  the  automatic  insertion 
of  additional  background  nodes  which  are  not  present  in 
the  foreground  grid.  Overall,  the  cell  surface  varies  very 
smoothly  with  a  (actor  of  about  100  000  from  the  small¬ 
est  cells  at  the  trailing  edge  of  the  vane  flap  to  the  largest 
cells  at  the  outer  boundary. 

The  only  user  input  for  the  case  were  the  328  boundary 
nodes  and  a  statement  requiring  no  connection  between 
the  main  aerofoil  and  the  main  (lap. 


5  CONCLUSIONS 

A  frontal  mechanism  for  the  creation  of  the  interior  nodes 
of  a  Delaunay  triangulation  has  been  developed.  The 
method  combines  the  high  node  distribution  quality  of 
the  Advancing  Front  method  with  the  optimal  connec¬ 
tivity  of  the  Delaunay  triangulation.  Precise  control  of 
node  spacing  is  achieved  by  the  use  of  the  initial  trian- 
gnlation  of  the  boundary  nodes  as  a  background  mesh 
with  nc  additional  effort  of  the  user.  The  node  genera¬ 
tion  does  not  require  explicit  tracking  of  the  front  and  is 
independent  of  the  oracr  in  which  triangles  arc  listed. 

We  arc  presently  working  on  a  generalization  that  can  in¬ 
corporate  stretching  to  obtain  a  non-isotropic  triaugula- 
lion.  All  features  of  this  concept  readily  extend  to  three 
dimensions  where  the  optimal  operation  count  and  the 
simplicity  of  boundary  control,  front  tracking  and  node 
construction  of  the  method  become  even  more  important. 
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14.  Abstract 

Lecture  notes  for  the  AGARD  Fluid  Dynamics  Panel  (FDP)  Special  Course  on  “Unstructured 
Grid  Methods  for  Advcction  Dominated  Flows”  have  been  assembled  in  this  report.  The 
objective  of  this  course  was  to  provide  state-of-the-art  information,  as  well  as  recent 
developments  in  unstructured  grid  methods,  suitable  for  the  computation  of  high  Reynolds 
number  compressible  and  incompressible  flows,  and  other  related  subjects.  Specifically,  topics 
and  methods  covered  include: 

—  L.cast  Squares  Galerkin  and  Streamline  Diffusion  Finite  Element  Methods 

—  Finite  Volume  Methods  and  Higher  Order  Polynomial  Reconstruction 

—  Essentially  Non  Oscillatory  Schemes  for  Unstructured  Grids 

—  Multidimensional  Upwind  Schemes  on  Triangles  and  Tetrahedra 

—  Grid  Generation  Methods  for  Unstructured  Grids  Using  the  Frontal  Method  and  Delaunay 
Principle 

—  Turbulence  Mod^’icjon  Unstructured  Grids 

—  Error  Estimato  "soiuiic^  Adaptivity 

—  Parallel  Compute  on  Unstructured  Grid 

—  Post  Processing  Unstructured  Grid  Data  Bases  for  Flow  Visualization  Analysis 

A  wide  range  of  applications  is  presented,  which  includes  incompressible  free  surface  problems, 
transonic  aerodynamics,  and  hypersonic  reentry  flows. 

The  material  assembled  in  this  report  was  prepared  and  presented  under  the  combined 
sponsorship  of  the  AGARD  Fluid  Dynamics  Panel,  the  Consultant  and  Exchange  Programme  of 
AGARD,  and  the  von  Karman  Institute  (VK1)  for  Fluid  Dynamics.  It  was  presented  as  an 
AGARD-FDP-VKI  Special  Course  at  the  VKI,  Rhode-Saint-Genese,  Belgium,  1 8th— 22nd  May 
1992  and  at  the  NASA  Ames  Research  Center,  Moffett  Field,  California,  United  States,  28th 
September— 2nd  October  1 992. _ 
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